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Chapter 1 

Introduction and Motivation 



Throughout, unless said otherwise, we will work in prepositional logic. 



1.1 Program 

The human agent in his daily activity has to deal with many situations involving change. Chief among them arc the 
following 

(1) Common sense reasoning from available data. This involves predication of what unavailable data is supposed to 
be (nonmonotonic deduction) but it is a defeasible prediction, geared towards immediate change. This is formally 
known as nonmonotonic reasoning and is studied by the nonmonotonic community. 

(2) Belief revision, studied by a very large community. The agent is unhappy with the totality of his beliefs which he 
finds internally unacceptable (usually logically inconsistent but not necessarily so) and needs to change/revise it. 

(3) Receiving and updating his data, studied by the update community. 

(4) Making morally correct decisions, studied by the deontic logic community. 

(5) Dealing with hypothetical and counterfactual situations. This is studied by a large community of philosophers and 
AI researchers. 

(6) Considering temporal future possibilities, this is covered by modal and temporal logic. 

(7) Dealing with properties that persist through time in the near future and with reasoning that is constructive. This is 
covered by intuitionistic logic. 

All the above types of reasoning exist in the human mind and are used continuously and coherently every hour of the 
day. The formal modelling of these types is done by diverse communities which are largely distinct with no significant 
communication or cooperation. The formal models they use are very similar and arise from a more general theory, what 
we might call: 

"Reasoning with information bearing binary relations" . 



1.2 Short overview of the different logics 

We will discuss the semantics of the propositional logic situation only. 

In all cases except the last two (i.e. Inheritance and Argumentation), the semantics consist of a set of classical models for 
the underlying language, with an additional structure, usually a binary relation (sometimes relative to a point of origin). 
This additional structure is not unique, and the result of the reasoning based on this additional structure will largely 
depend on the specific choice of this structure. The laws which are usually provided (as axioms or rationality postulates) 
are those which hold for any such additional structure. 



1.2.1 Nonmonotonic logics 

Nonmonotonic logics (NML) were created to deal with principled reasoning about "normal" situation. Thus, "normal" 
birds will (be able to) fly, but there are many exceptions, like penguins, roasted chickens, etc., and it is usually difficult to 
enumerate all exceptions, so they will be treated in bulk as "abnormal" birds. 

The standard example is - as we began to describe already - that "normal" birds will (be able to) fly, that there are 
exceptions, like penguins, that "normal" penguins will not fly, but that there might be exceptions to the exceptions, that 
some abnormal penguin might be able to fly - due to a jet pack on its back, some genetic tampering, etc. Then, if we know 
that some animal is a bird, call it "Tweety" as usual, and if we want to keep it as a pet, we should make sure that its cage 

1 r :_L • - Tf 1 1 j-l_-_L rT^ • i_ 1__ _ J 1 ±_ _ • j_l 



10 



CHAPTER 1. INTRODUCTION AND MOTIVATION 



Note that this reasoning is nonmonotonic: From the fact "Tweety is a bird" , we conclude that it will (normally) fly, but 
from the facts that "Tweety is a bird" and "Tweety is a penguin" , we will not conclude that it will (normally) fly any 
more, we will even conclude the contrary, that it will (normally) not fly. 

We can also see here a general principle at work: more specific information (Tweety is a penguin) and its consequences 
(Tweety will not fly) will usually be considered more reliable than the more general information (Tweety is a bird) and its 
consequences (Tweety will fly). Then, NML can also be considered as a principled treatment of information of different 
quality or reliability. The classical information is the best one, and the conjecture that the case at hand is a normal one 
is less reliable. 

Note that normality is absolute here in the following sense: normal birds will be normal with respect to all "normal" 
properties of birds, i.e. they will fly, lay eggs, build nests, etc. In this treatment, there are no birds normal with respect 
to flying, but not laying eggs, etc. 

It is sometimes useful to introduce a generalized quantifier V. In a first order (FOL) setting \Jx<j)(x) will mean that cp(x) 
holds almost everywhere, in a propositional setting V0 will mean that in almost all models <p> holds. Of course, this "almost 
everywhere" or "almost all" has to be made precise, e.g. by a filter over the FOL universe, or the set of all propositional 
models. 

Inheritance systems will be discussed separately below. 

• Formal semantics by preferential systems 

The semantics for preferential logics are preferential structures, a set of classical models with an arbitrary binary 
relation. This relation need not be transitive, nor docs it need to have any other of the usual properties. If m -< m , 
then to is considered more normal (or less abnormal) than m' . m is said to be minimal in a set of models M iff there 
is no m! 6 M, m! -< m - a word of warning: there might be m' -< m, but m' g" M\ 

This defines a semantic consequence relation as follows: we say <f> |~ ip iff ip holds in all minimal models of <p. 

As a model m might be minimal in M(cp) - the set of models of <p - but not minimal in M(ip), where |= <fi — > ip 
classically, this consequence relation |~ is nonmonotonic. Non-flying penguins are normal (— minimally abnormal) 
penguins, but all non-flying birds are abnormal birds. 

Minimal models of <p need not exist, even if <p is consistent - there might be cycles or infinite descending chains. We 
will write M{<p) for the set of 0— models, and p((f>) or p(M{<p)) for the set of minimal models of <f>. If there is some 
set X and some x' 6 X s.t. x' -< x, we say that x' minimzes x, likewise that X minimizes x. We will be more precise 
in Chapter [J] (page [55]) . 

One can impose various restrictions on -<, they will sometimes change the resulting logic. The most important one 
is perhaps rankedness: If m and m' are -< —incomparable, then for all m" m" -< m iff m" -< m' and also m -< m" iff 
m -< m . We can interpret the fact that m and m are -< —incomparable by putting them at the same distance from 
some imaginary point of maximal normality. Thus, if m is closer to this point than m" is, then so will be mf, and if 
m is farther away from this point than m is, then so will be m' . (The also very important condition, smoothness, 
is more complicated, so the reader is referred to Chapter HI (page l55"j) for discussion. 

Preferential structures are presented and discussed in Chapter 2] (page . 



1.2.2 Theory revision 

The problem of Theory Revision is to "integrate" some new information <f) into an old body of knowledge K such that the 
result is consistent, even if K together with <p (i.e. the union K U {</>}) is inconsistent. (We will assume that K and <p are 
consistent separately.) 

The best examined approach was first published in AGM85J, and is know for the intials of its authors as the AGM 
approach. The formal presentation of this approach (and more) is in Chapter 18.21 (page ll58|) . 

This problem is well known in juridical thinking, where a new law might be inconsistent with the old set of laws, and the 
task is to "throw away" enough, but not too many, of the old laws, so we can incorporate the new law into the old system 
in a consistent way. 

We can take up the example for NML, and modify it slightly. Suppose our background theory K is that birds fly, in the 
form: Blackbirds fly, ravens fly, penguins fly, robins fly, . . . ., and that the new information is that penguins don't fly. Then, 
of course, the minimal change to the old theory is to delete the information that penguins fly, and replace it with the new 
information. 

Often, however, the situation is not so simple. K might be that ip holds, and so does ip — ► p. The new information might 
be that holds. The radical - and usually excessive - modification will be to delete all information from K, and just take 
the new information. More careful modifications will be to delete either ip or ip — > p, but not both. But there is a decision 
problem here: which of the two do we throw out? Logic alone cannot tell us, and we will need more information to take 
this decision. 

• Formal semantics 

In many cases, revising K by <p is required to contain <p, thus, if * denotes the revision operation, then K * <p h <p 
(classically). Dropping this requirement does not change the underlying mechanism enormously, we will uphold it. 

Speaking semantically, K * <p will then be defined by some subset of M(<f>). If we choose all of </>, then any influence 
of K is forgotten. A good way to capture this influence seems to choose those models of </>, which are closest to the 
A"— models, in some way, and with respect to some distance d. We thus choose those 0— models m such that there is 
n G M(K) with d(n,m) minimal among all d(n',m'), n' £ M(K), ml & M (</>). (We assume again that the minimal 
distance exists, i.e. that there are no infinite descending chains of distances, without any minimum.) Of course, the 

_ - Jl* J :_ 1 • £] _ J T7l__- • j -1 * _7 J_ • • _ 1 J* J 
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This semantic approach corresponds well to the classical, syntactic AGM revision approach in the following sense: 
When we fix K, this semantics corresponds exactly to the AGM postulates (which leave K fixed). When we allow 
K to change, we can also treat iterated revision, i.e. something like (K * </>) * -0, thus go beyond the AGM approach 
(but pay the price of arbitrarily long axioms) . This semantics leaves the order (or distance) untouched, and is thus 
fundamentally different from e.g. Spohn's Ordinal Conditional Functions. 

1.2.3 Theory update 

Theory Update is the problem of "guessing" the results of in some way optimal developments. 

Consider the following situation: There is a house, at time 0, the light is on, and so is the deep freezer. At time 1, the light 
is off. Problem: Is the deep freezer still on? The probably correct answer depends on circumstances. Suppose in situation 
A, there is someone in the house, and weather conditions are normal. In situation B, there is no one in the house, and 
there is a very heavy thunderstorm going on. Then, in situation A, we will conjecture that the person(s) in the house 
have switched the light off, but left the deep freezer on. In situation B, we might conjecture a general power failure, and 
that the deep freezer is now off, too. 

We can describe the states at time and 1 by a triple: light on/off, freezer on/off, power failure yes/no. 

In situation A, we will consider the development (light on, freezer on, no power failure) to (light off, freezer on, no power 
failure) as the most likely (or normal) one. 

In situation B we will consider the development (light on, freezer on, no power failure) to (light off, freezer off, power 
failure) as the most likely (or normal) one. 

Often, we will assume a general principle of inertia: things stay the way as they are, unless they are forced to change. 
Thus, when the power failure is repaired, freezer and light will go on again. 

• Formal semantics 

In the general case, we will consider a set of fixed length sequences of classical models, say to = (mo, mi, ■ ■ . , m n ), 
which represent develoments considered possible. Among this set, we have some relation -<, which is supposed to 
single out the most natural, or probable ones. We then look at some coordinate, say i, and try to find the most 
probable situation at coordinate i. For example, we have a set S of sequences, and look at the theory defined by the 
information at coordinate i of the most probable sequences of S : Th({rrii : m e fi(S)}) - where fJ,(S) is the set of the 
most probable sequences of S, and Th(X) is the set of formulas which hold in all models x s X. 
Looking back at our above intuitive example, S will be the set of sequences consisting of 

((i j, -p),a,/,p)>, 

((!,/, -p),(l,f, -p)), 
((I J, -p),(Z,-/,p)>, 
((I J, -p),(l,-f, -p)>, 

{(I J, -P),(-I,f,p)), 
((I J, -p),(-l,f, -p)>, 
((/,/, - p ),(-l,-f,p)), 
((I J, -p),(-l,-f, -p)}, 

where "1" stands for "light on" , "f" for "freezer on" , "p" for "power failure" etc. The "best" sequence in situation A 
will be ((I, f, —p), (—1, /, —p)), and in situation B ((I, f, —p), (—1, —f,p)). Thus, in situation A, the result is defined 
by -1, /, -p, etc. - the theory of the second coordinate. 

Thus, again, the choice of the actual distance has an enormous influence on the outsome. 



1.2.4 Deontic logic 

Deontic logic treats (among other things) the moral acceptability of situations or acts. 

For instance, when driving a car, you should not cause accidents and hurt someone. So, in all "good" driving situations, 
there are no accidents and no victims. Yet, accidents unfortunately happen. And if you have caused an accident, you 
should stop and help the possibly injured. Thus, in the "morally bad" situations where you have caused an accident, the 
morally best situations are those where you help the victims, if there are any. 

The parallel to above example for NML is obvious, an d, a s a m atter of fact, the first preferential semantics was given for 
deontic, and not for nonmonotonic logics - see Section lTTTl (page ll23[) . 

There is, however, an important difference to be made. Preferential structures for NML describe what holds in the normally 
best models, those for deontic logic what holds in "morally" best models. But obligations are not supposed to say what 
holds in the morally best worlds, but are supp osed to d istinguish in some way the "good" from the "bad" models. This 
problem is discussed in extenso in Section [7. II (page I123[) . 

• Formal semantics 

As said already, preferential structures as defined above for NML were given as a semantics for deontic logics, before 
NML came into existence. 

A word of warning: Here, the morally optimal models describe "good" situations, and not directly actions to take. 
This is already obvious by the law of weakening, which holds for all such structures: If <fi holds in all minimal models, 
and h (f> — ► tp (classically), then so does t/j. But if one should be kind, then it does not follow that one should be kind 

nr L-ill c~\ n i-i ' a rrron^TnAtlior nnm-cn tiro r> a m Turn fl^io rnQOAuinn' in+r\ orlin/^o LrM- opf iatt oof fln^ urOTr fViof fVin nnforM-nn 
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A counterfactual conditional states an implication, where the antecedent is (at the moment, at least) wrong. "If it were to 
rain, he would open his umbrella." This is comprehensible, the person has an umbrella, and if it were to start to rain now, 
he would open the umbrella. If, however, the rain would fall in the midst of a hurricane, then opening the umbrella would 
only lead to its destruction. Thus, if, at the moment the sentence was uttered, there was no hurricane, and no hurricane 
announced, then the speaker was referring to a situation which was different from the present situation only in so far as it 
is raining, or, in other words, minimally different from the actual situation, but with rain falling. If, however, there was a 
hurricane in sight at the moment of uttering the sentence, we might doubt the speakers good sense, and point the problem 
out to him/her. We see here again a reasoning about minimal change, or normal situations. 

• Formal semantics 

Stalnaker and Lewis first gave a minimal distance semantics in the following way: 

If we are in the actual situation m, then <f> > ip (read: if <f> were the case, then ip would also hold) holds in m, iff in all 
4>— models which are closest to m, ip also holds. Thus, there might well be 0— models where ip fails, but these are not 
among the 0— models closest to m. The distance will, of course, express the difference between the situation m and 
the models considered. Thus, in the first scenario, situations where it rains and there is no extreme wind condition 
are closer to the original one than those where a hurricane blows. 

In the original approach, distances from each possible actual situation are completely independent. It c an, how ever, 
be shown that we can achieve the same results with one uniform distance over the whole structure, see |SM94j . 



1.2.6 Modal logic 

Modal logic reasons about the possible or necessary. If we are in the midwest of the US, and it is a hurricane season, then 
the beautiful sunny weather might turn into a hurricane over the next hours. Thus, the weather need not necessarily stay 
the way it is, but it might become a very difficult situation. Note that we reason here not about what is likely, or normal, 
but about what is considered possible at all. We are not concerned only about what might happen in time t + 1, but 
about what might happen in some (foreseeable, reasonable) future - and not about what will be the case at the end of the 
developments considered possible either. Just everything which might be the case some time in the near future. 

"Necessary" and "possible" are dual: if (p is necessarily the case, this means that it will always hold in all situations evolving 
from the actual situation, and if <p> is possibly the case, this means that it is not necessary that -></> holds, i.e. there is at 
least some situation into which the present can evolve and where <fi holds. 

• Formal semantics 

Kripke gave a semantics for Modal Logic by possible worlds, i.e. a set of classical models, with an additional binary 
relation, expressing accessibility. 

If m is in relation R with n, mRn, then m can possibly become n, is a possibility seen from m, or whatever one might 
want to say. Again, R can be any binary relation. The necessity operator is essentially a universal quantifier, D0 
holds in m iff (p holds in all n accessible via R from m. Likewise, the possibility operator is an existential quantifier, 
<><p holds in m iff there is at least one n accessible from m where (p holds. 

Again, it is interesting to impose additional postulates on the relation R, like reflexivity, transitivity, etc. 



1.2.7 Intuitionistic logic 

Intuitionistic Logic is (leaving philosophy aside) reasoning about performed constructions and proofs in mathematics, or 
development of (certain) knowledge. We may have a conjecture - or, simply, any statement -, and a proof for it, a proof for 
its contrary, or neither. Proofs are supposed to be correct, so what is considered a proof will stay one forever. Knowledge 
can only be won, but not lost. If we have neither a proof nor a refutation, then we might one day have one or the other, 
or we might stay ignorant forever. 

• Formal semantics 

Intuitionistic Logic can also be given a semantics in the style of the one for Modal Logics. There are two, equivalent, 
variants. The one closer to Modal Logic interprets intuitionistic statements (in the above sense that a construction 
or proof has been performed) as preceeded by the necessity quantifier. Thus, it is possible that in m neither D<p nor 
D-xfi hold, as we have neither a proof for (p, nor for its contrary, and might well find one for one or the other in some 
future, possible, situation. Progressing along R may, if the relation is transitive, only make more statements of the 
form D<p true, as we quantify then over less situations. 



1.2.8 Inheritance systems 

Inheritance systems or diagrams are directed acyclic graphs with two types of arrows, positive and negative ones. Roughly, 
nodes stand for sets of objects, like birds, penguins, etc., or properties like "able to fly". A positive arrow a — > b stands for 
" (almost) all x £ a are also in b " - so it admits exceptions. A negative arrow a — > b stands for " (almost) all x <E a are not 
in b " - so it also admits exceptions. Negation is thus very strong. The problem is to find the valid paths (concatenations 
of arrows) in a given diagram, considering contradictions and specificity. See Chapter 191 fpage [169p for a deeper explanaton 
and formal definitions. 
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14 CHAPTER 1. INTRODUCTION AND MOTIVATION 

Notation 1.3.1 

We will use for the global universal modal quantifier: jtcf) holds in a model iff (j> holds everywhere - it is the dual of 
consistency. 

□ and O are th e u sual universal and existential modal quantifiers, recall that V is some normality quantifier, see e.g. 
Chapter [3] (page S3). 

1.3.1 Basic semantic entities, truth values, and operators 

1.3.1.1 The levels of language and semantics 

We have several levels: 

(1) the language and the truth values 

(2) the basic semantical entities, e.g. classical models, maximal consistent sets of modal formulas, etc. 

(3) abstract or algebraic semantics, which describe the interpretation of the operators of the language in set-theoretic terms, 
like the interpretation of A by V ( "the normal cases of" ) by [i (choice of minimal elements), etc. These semantics do 
not indicate any mechanism which generates these abstract operators. 

(4) structural semantics, like Kripke structures, preferential structures, which give such mechanisms, and generate the 
abstract behaviour of the operators. They are or should be the intuitive basis of the whole enterprise. 

(In analogue, we have a structural, an abstract, and a logical limit - see Section [53] (page [101]).) 

1.3.1.2 Language and truth values 

A language has 

• variable parts, like propositional variables 

• constant parts, like 

— operators, e.g. A, V, □, etc, and 

— relations like a consequence relation |~, etc. 

Operators may have a 

• unique interpretation, like A, which is always interpreted by D, 

• or only restrictions on the interpretation, like V, □ 

Operators and relations may be 

• nested, like A, V, 

• or only flat (perhaps on the top level), like |~ 

The truth values are part of the overall framework. For the moment, we will tacitly assume that there is only TRUE and 
FALSE. We will see in (1.3.1) that this restriction is unimportant for our purposes. 

1.3.1.3 Basic semantical entities 

The language speaks about the basic semantic entities. Note that the language will usually NOT speak about the relation 
of a Kripke or preferential structure, etc., only about the resulting function, resp. the operator which is interpreted by the 
function, as part of formulas - we do not speak directly about operators, but only as part of formulas. 

For the same language, there may be different semantic entities. The semantic entities are (perhaps consistent, perhaps 
complete wrt. the logic) sets of formulae of the language. They are descriptions - in the language - of situations. They 
are NOT objects (in FOL we have names for objects), nor situations, but only descriptions, even if it may help to consider 
them as such objects, (but unreal ones, just as mannequins in shop windows are unreal - they are only there to exhibit the 
garments.) 

An example for different semantic entities is intuitionist logics, where we may take 

• knowledge states, which may be incomplete (this forces the relation in Kripke structures to be monotonic), or 

• classical models, where □ codes knowledge, and, automatically, its growth. 

(Their equivalence is a mathematical result, the former approach is perhaps the philosophically better one, the second one 
easier to handle, as intuitionistic formulas are distinct by the preceding □.) 

The entities need not contain all formulas with all operators, perhaps they are only sets of propositional variables, with 
no operators at all, perhaps they are all consistent sets of formulas of some sublanguage. For classical logic, we can take 
as basic entities either just sets of propositional variables, or maximal consistent sets of formulas. 
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• Any set of formulas is a candidate for a semantic entity. 

• If we want them to be complete, eliminate those who are not. 

• Eliminate all which are contradictory under the operators and the logic which governs their behaviour - e.g. p, q, 
and -ip V together cannot hold in classical models. 

In this approach, the language determines all situations, the logic those which are possible. We thus have a clean distinction 
between the work of the language and that of the logic. 

For preferential reasoning (with some relation |~ outside the "core language" ), we may again take all classical models 
- however defined - and introduce the interpretation of |~ in the algebraic or abstract superstructure (see below), but 
we may also consider a normality operator V directly in the language, and all consistent sets of such formulas (see e.g. 
SGMRTOO ). Our picture is large enough to admit both possibilities. 

In modal logic, we may again consider classical models, or, as is mostly done, (maximal consistent) sets of formulas of the 
full language. 

The choice of these entities is a philosophical decision, not dictated by the language, but it has some consequences. - See 
below (nested preferential operators) for details. We call these entities models or basic models. 



1.3.1.4 Several truth values in basic semantic entities 

When we consider sets of formulas as basic models, we assume implicitly two truth values: everything which is in the set, 
has truth value TRUE, the rest truth value FALSE (or undecided - context will tell). Of course, we can instead consider 
(partial) functions from the set of formulas into any set of truth values - this does not change the overall approach. E.g., 
in an information state, we might have been informed that <f> holds with some reliability r, in another one with reliability 
r' , etc., so r, r' etc. may be truth values, and even pairs {r, r'} when we have been informed with reliability r that and 
with reliability r' that ->(f>. Whatever is reasonable in the situation considered should be admitted as truth value. 



1.3.2 Algebraic and structural semantics 

We make now a major conceptual distinction, between an "algebraic" and a "structural" semantics, which can best be 
illustrated by an example. 

Consider nonmonotonic logics as discussed above. In preferential structures, we only consider the minimal elements, say 
n(X), if A is a set of models. Abstractly, we thus have a choice function /i, defined on the power set of the model set, and 
\i has certain properties, e.g. /i(A) C X. More important is the following property: A C Y — » [i(Y) fl A C ^i(X). (The 
proof is trivial: suppose there were x G n(Y) H A, a; /i(A). Then there must be x' -< x, x' G A C Y, but then x cannot 
be minimal in Y.) 

Thus, all preferential structures generate \i functions with certain properties, and once we have a complete list, we can show 
that any arbitrary model choice function with these properties can be generated by an appropriate preferential structure. 

Note that we do not need here the fact that we have a relation between models, just any relation on an arbitrary set 
suffices. It seems natural to call the complete list of properties of such fx— functions an algebraic semantics, forgetting that 
the function itself was created by a preferential structure, which is the structural semantics. 

This distinction is very helpful, it not only incites us to separate the two semantics conceptually, but also to split complete- 
ness proof in two parts: One part, where we show correspondence between the logical side and the algebraic semantics, 
and a second one, where we show the correspondence between the algebraic and the structural semantics. The latter part 
will usually be more difficult, but any result obtained here is independent from logics itself, and can thus often be re-used 
in other logical contexts. On the other hand, there are often some subtle problems for the c orrespo ndence between the 
logics and the algebraic semantics (see definability preservation, in particular the discussion in [Sch04J), which we can then 
more clearly isolate, identify, and solve. 



1.3.2.1 Abstract or algebraic semantics 



In all cases, we see that the structural semantics define a set operator, and thus an algebraic semantics: 

• in nonmonotonic logics (and Deontic Logic), the function chooses the minimal (morally best) models, a subset, 

• in (distance based) Theory Revision, we have abinary operator, say | which chooses the <f>— models closest to the set 
of A-models: M(K) \ M{(j)) 

• in Theory Update, the operator chooses the i— th coordinate of all best sequences 

• in the Logic of Counterfactual Conditionals, whave again a binary operator m \ M{<f>) which chooses the 0— models 
closest to m, or, when we consider a whole set A of models as starting points A | M{<j>) — | M(<f>) : m G A}. 

• in Modal and Intuitionistic Logic, seen from some model m, we choose a subset of all the models (thus not a subset 
of a more restricted model set), those which can be reached from m. 

Thus, in each case, the structure "sends" us to another model set, and this expresses the change from the original situation 

j_ _ _ n j_ __i ttl J_)5 ii -_L- *j _L* Tj_ j 1 i- - --11 -11 1_ 1 • _ U 12 J -1_1 
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(Note again that we have neglected here the possibility that there are no best or closest models (or sequences), but only 
ever better ones.) 

Abstract semantics are interpretations of the operators of the language (all, flat, top level or not) by functions (or relations 
in the case of |~), which assign to sets of models sets of models, O : V(Ai) — > 'P(J^i) - V the power set operator, and M. 
the set of basic models -, or binary functions for binary operators, etc. 

These functions are determined or restricted by the laws for the corresponding operators. E.g., in classical, preferential, 
or modal logic, A is interpreted by fl, etc.; in preferential logic V by /i; in modal logic, we interpret □, etc. 

Operators may be truth-functional or not. -> is truth-functional. It suffices to know the truth value of <p at some point, to 
know that of -><p at the same point. □ is not truth-functional: <p and ip may hold, and □(/>, but not Oip, all at the same 
point (= base model), we have to look at the full picture, not only at some model. 

We consider first those operators, which have a unique possible interpretation, like A, which is interpreted by fl, -> by 
C, the set theoretic complement, etc. V (standing for "most", "the important", etc.) e.g. has only restrictions to its 
interpretation, like p{X) C X, etc. Given a set of models without additional structure, we do not know its exact form, we 
know it only once we have fixed the additional structure (the relation in this case). 

If the models contain already the operator, the function will respect it, i.e. we cannot have <p an d ~«p in the same model, 
as i is interpreted by C. Thus, the functions can, at least in some cases, control consistency. 

If, e.g. the models contain A, then we have two ways to evaluate <f> A tp : we can first evaluate </>, then ip, and use the 
function for A to evaluate <p A ip. Alternatively, we can look directly at the model for <p A ip - provided we considered the 
full language in constructing the models. 

As we can apply one function to the result of the other, we can evaluate complicated formulas, using the functions on the 
set of models. Consequently, if |~ or V is evaluated by [A, we can consider fi(p(X)) etc., thus, the machinery for the flat 
case gives immediately an interpretation for nested formulas, too - whether we looked for it, or not. 

As far as we see, our picture covers the usual presentations of classical logic, preferential, intuionist, and modal logic, but 
also of linear logic (where we have more structure on the set of basic models, a monoid, with a distinct set _L, plus some 
topology for! and? - see below), and quantum logic a la Birkhoff/von Neumann. 

We can introduce new truth-functional operators into the language as follows: Suppose we have a distinct truth value 
TRUE, then we may define Ox(4>) — TRUE iff the truth- value of <p is an element of X. This might sometimes be helpful. 
Making the truth value explicit as element of the object language may facilitate the construction of an accompanying proof 
system - experience will tell whether this is the case. In this view, -i has now a double meaning in the classical situation: 
it is an operator for the truth value "false", and an operator on the model set, and corresponds to the complement. "Is 
true" is the identical truth functional operator, is — true{<p) and <p have the same truth value. 

If the operators have a unique interpretation, this might be all there is to say in this abstract framework. (This does not 
mean that it is impossible to introduce new operators which are independent from any additional structure, and based only 
on the set of models for the basic language. We can, for instance, introduce a "CON" operator, saying that cp is consistent, 
and CON(cp) will hold everywhere iff <p is consistent, i.e. holds in at least one model. Or, for a more bizarre example, 
a 3 operator, which says that <p has at least 3 models (which is then dependent on the language). We can also provide 
exactly one additional structure, e.g. in the following way: Introduce a ranked order between models as follows: At the 
bottom, put the single model which makes all propositional variables true, on the next level those which make exactly one 
propositional variable true, then two, etc., with the model making all false on top. So there is room to play, if one can find 
many useful examples is another question.) 

If the operator has no unique interpretation (like V, □, etc., which are only restricted, e.g. by — > ip) — > 4^(Vip/\ip — > V</>), 
the situation seems more complicated, and is discussed below in Section [1.3.31 fpage !16[) . 

It is sometimes useful to consider the abstract semantics as a (somehow coherent) system of filters. For instance, in 
preferential structures, fi(X) C X can be seen as the basis of a principal filter. Thus, <p |~ ip iff ip holds in all minimal 
models of <fi, iff there is a "big" subset of M(<p) where ip holds, recalling that a filter is an abstraction of size - sets in 
the filter are big, their complements small, and the other sets have medium size. Thus, the "normal" elements form the 
smallest big subset. Rules like IC7-) mOO H A C u(X) form the coherence between the individual filters, we cannot 
choose them totally independently. Particularly for preferential structures, the reasoning with small and big subsets can be 
made very precise and intuitively appealing, and we will come back to this point later. We can also introduce a generalized 
quantifier, say V, with the same meaning, i.e. (f> \^ ip iff V(</>).i/>, i.e. "almost everywhere", or "in the important cases" 
where <p holds, so will ib. This is then the syntactic analogue of the semantical filter system. These aspects are discussed 
in detail in Chapter 131 fpage . 



1.3.2.2 Structural semantics 

Structural semantics generate the abstract or algebraic semantics, i.e. the behaviour of the functions or relations (and of 
the operators in the language when we work with "rich" basic models) . Preferences between models generate corresponding 
p.— functions, relations in Kripke structures generate the functions corresponding to □— operators, etc. 

Ideally, structural semantics capture the essence of what we want to reason and speak about (beyond classical logic), they 
come, or should come, first. Next, we try to see the fundamental ingredients and laws of such structures, code them in an 
algebraic semantics and the language, i.e. extract the functions and operators, and their laws. In a backward movement, we 
make the roles of the operators (or relations) precise (should they be nested or not?, etc.), and define the basic models and 
the algebraic operators. This may result in minor modifications of the structural semantics (like introduction of copies), 
but should still be close to the point of outset. In this view, the construction of a logic is a back- and- forth movement. 



1.3.3 Restricted operators and relations 

We discuss onlv operators, relations seem to be similar. The discussion applies as well to abstract as to structural semantics. 
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Thus, the interpretation will be more definite than the operator. It seems that the problem has no universal solution. 

(1) If there is tacitly a "best choice", it seems natural to make this choice. At the same time, such a best choice may also 
serve to code our ignorance, without enumerating all possible cases among which we do not know how to decide. 

For instance, in reasoning about normality (preferential structures), the interpretation which makes nothing more normal 
than explicitly required - corresponding to a set-theoretically minimal relation - seems a natural choice. This will NOT 
always give the same result as a disjunction over all possible interpretations: e.g., if the operator is in the language, and 
we have finitely many possibilities, we can express them by "or" , and this need not be the same as considering the unique 
minimal solution. (Note that this will usually force us to consider "copies" in preferential structures - see below.) 

(2) We can take all possible interpretations, and consider them separately, and take as result only those facts which hold in 
all possibilities. Bookkeeping seems difficult, especially when we have nested operators, which have all to be interpreted in 
the various ways. In a second step, we can unite all possibilities in one grand picture (a universal structure, as it permits 
to find exactly all consequences in one construction, and not in several ones as is done for classical logic), essentially by a 
disjoint union - this was done (more or less) in the authors' [SGMRTOO] for preferential structures. 

(3) We can work with basic models already in the full language, and capture the different possibilities already on the basic 
level. The interpretation of the operator will then be on the big set of models for the full language, which serve essentially 
as bookkeeping device for the different possibilities of interpretation - again an universal structure. This is done in the 
usual completeness proofs for modal logic. 



1.3.4 Copies in preferential models 

Copies in preferential structures (variant (2) in Section ri.3.3l fpage ll6p ) thus seem to serve to construct universal structures, 
or code our ignorance, i.e. we know that x is minimized by X, but we do not know by which element of X, they are in this 
view artificial. But they have an intuitive justification, too: They allow minimization by sets of other elements only. We 
may consider an element m only abnormal in the presence of several other elements together. E.g., considering penguins, 
nightingales, woodpeckers, ravens, they all have some exceptional qualities, so we may perhaps not consider a penguin 
more abnormal than a woodpecker, etc., but seen all these birds together, the penguin stands out as the most abnormal 
one. But we cannot code minimization by a set, without minimization by its elements, without the use of copies. Copies 
will then code the different aspects of abnormality. 



1.3.5 Further remarks on universality of representation proofs 

There is a fundamental difference between considering all possible ramifications and coding ignorance. For instance, if we 
know that {a, 6,c} is minimized by {b, c}, we can create two structures, one, where a is minimized by 6, the other, where 
a is minimized by c. These are all possible ramifications (if /j,({a}) ^ 0). Or, we can code our ignorance with copies of 
a, as is done in our completeness constructions, (({a, b} |~ 6) or ({a,c} (~ c)) is different from {a, 6, c} |~ {b, c}, and if 
the language is sufficient, we can express this. In a "directly ignorant" structure, none of the two disjoints hold, so the 
disjunction will fail. 

Our proofs try to express ignorance directly. Note that a representing structure can be optimal in several ways: (1) 
optimal, or universal, as it expresses exactly the logic, (2) optimal, as it has exactly the required properties, but not more. 
For instance, a smooth structure can be optimal, as it expresses exactly the logic it is supposed to code, or optimal, as 
it preserves exactly smoothness, there is no room to move left. Usually, one seeks only the first variant. If both variants 
disagree, then structure and model do not coincide exactly, the structure still has more space to move than necessary for 
representation. 

In our constructions, all possibilities are coded into the choice functions (the indices), but as they are not directly visible 
to the language, they are only seen together, so there is no way to analyze the "or" of the different possibilities separately. 



1.3.6 |~ in the object language? 

It is tempting to try and put a consequence relation |~ into the object language by creating a new modal operator V 
(expressing "most" or so), with the translation <fi |~ ip iff \- V<j) — > ip- 

We examine now this possibility and the resulting consequences. 

We suppose that — > will be interpreted in the usual way, i.e. by the subset relation. The aim is then to define the 
interpretation of V s.t. (j> (~ ip iff M(V^) C M(ip) - M{<j>) the set of models of (j>. 

It need not be the case - but it will be desirable - that V is insensitive to logical equivalence. V</> and V</>' may well be 
interpreted by different model sets, even if <j> and <// are interpreted by the same model sets. Thus, we need not left logical 
equivalence - whatever the basic logic is. On the right hand side, as we define |~ via — >, and — > via model subsets, we will 
have closure under semantic consequence (even infinite closure, if this makes a difference). It seems obvious, that this is 
the only property a relation |~ has to have to be translatable into object language via "classical" — or, better, subsets of 
models. 

Note that the interpretation of V</> can be equivalent to a formula, a theory, or to a set of models logically equivalent to a 
formula or theory, even if some models are missing (lack of definability preservation). Likewise, we may also consider VT 
for a full theory T. The standard solution will, of course, be M(V(p) := (~){M(ijj) : <fi |~ ijj}. 



1.3.6.1 Possibilities and problems of external |~ vs. internal V 
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• The translation of |~ into the object language - whenever possible - introduces contraposition into the logic, see also 
[Sch04] for a discussion. 

• It may also help to clarify questions like validity of the deduction theorem (we see immediately that one half is 
monotony), cut, situations like V0 — ► ip and — > 0, etc. 

• It is more usual to consider full theories outside the language, even if, a priori, there is no problem to define a formula 
VT using a theory T in the inductive step. Thus, the external variant can be more expressive in one respect. 

• At least in principle, having V inside the language makes it more amenable to relativize it to different viewpoints, 
as in modal logic. It seems at least more usual to write m \= V0 — > ip than to say that in to, tfi (~ ip holds. 



1.3.6.2 Disjunction in preferential and modal structures 

In a modal Kripke model, D0 V Dip may hold everywhere, but neither Dcp nor Dip may hold everywhere. This is possible, 
as formulas are evaluated locally, and at one point to, we may make Dcp hold, at another ml Dtp. 

This is not the case for the globally evaluated modal operator A. Then, in one structure, if 4(0) V 4(0) holds, either 4(0) 
or ♦(■0) holds, but a structure where 4(0) (or 4(0)) holds, has more information than a structure in which 4(0) V 4(0) 
holds. Consequently, one Kripke structure is not universal any more for 4 (and V), we need several such structures to 
represent disjunctions of A. 

The same is the case for preferential structures. If we put |~ into the language as V, to represent disjunctions, we may 
need several preferential structures: (V</> — ► 0) V (V</> — ► ip') will only be representable as V0 — > ip or as V0 — > ip' , 
but in one structure, only one of them may hold. Thus, again, one structure will say more than the original formula 
(V0 — > ip) V (V0 — ► 0'), and to express this formula faithfully, we will need again two structures. Thus, putting |~ into 
the object language destroys the universality of preferential structures by its richer language. 

(Remark: rational monotony is, semantically, also universally quantified, a ^7 -> a A/3 ^7 or everywhere a |~ -i/3. 



1.3.6.3 Iterated V in preferential structures 

Once we have V in the object language, we can form VV0 etc. 

When we consider preferential structures in the standard way, it is obvious that VV0 = V0, etc. will hold. 

But, even in a preferential context, it is not obvious that this has to hold, it suffices to interpret the relation slightly 
differently. Instead of setting n(X) the set of minimal elements of X, it suffices to define n(X) the set of non-worst 
elements of X, i.e. everything except the upper layer. (One of the authors once had a similar discussion with M.Magidor.) 
(Note that, in this interpretation, for instance VV0 may seem to be the same as V0, but VVV0 clearly is different from 
VV0 : In going from V0 to VV0, we loose some models, but not enough to be visible by logics - a problem of definability 
preservation. The loss becomes visible in the next step.) 

But, we can have a similar result even in the usual interpretation of "normal" by "best": 

Consider for any X C oj the logic |~ x defined by the axioms {Ai : i G X}, where Ai :— V J+1 V l <p, and V*0 is, of 
course, i many V's, followed by <p, etc., plus the usual axioms for preferential logics. This defines 2 U many logics, and we 
show that they are all different. The semantics we will give show at the same time that the usual axioms for preferential 
logics do not entail VV0 <-> V0. 

For simplicity, we first show that the uj many logics defined by the axioms Bi = V' i+1 <-» V'0 for arbitrary (p are different. 
We consider sequences of to many preferential structures over some infinite language, s.t. we choose exactly at place i the 
same structure twice, and all the other times different structures. Let Si be the structure which minimizes -^pi to pi, i.e. 
every pi — model m is smaller than its opposite to', which is like to, only ml \= ->pi. It is obvious, that the associated 
[/,— functions ^ will give different result on many sets X (if a model x G X is minimized at all in X, it will be minimized 
in different ways). Consider now e.g. a formula VV0. We start to evaluate at So, evaluating the leftmost V by /i . The 
second V will be evaluated at Si, by fj,\. If, for instance, is a tautology, we eliminate in the first step the ^pq— models, in 
the second step the -<pi — models, so VV is not equivalent to V. If, instead of taking at position i + 1 Sj+i, we just take 
Si again, then axiom Bi will hold, but not the other Bj, i j. Thus, the logics Bi are really different. 

In the first case, for the Ai, we repeat Sj for all j G X, instead of taking Sj+i, and start evaluation again at Sq. Again, 
the different sequences of structures will distinguish the different logics, and we are done. 

Yet the standard interpretation of V in one single preferential structure does not allow to distinguish between all these 
logics, as sequences of V's will always be collapsed to one V. As long as a (preferential) consequence relation satisfies this, 
it will hold in the standard interpretation, if not, it will fail. 

This is another example which shows that putting |~ as V in the object language can increase substantially the expres- 
siveness of the logics. 

Compare the situation to modal logic and Kripke semantics. In Kripke semantics, any evaluation of □ takes us to different 
points in the structure, and, if the relation is not transitive, what is beyond the first step is not visible from the origin. In 
preferential structures, this "hiding" is not possible, by the very definition of /i, we have to follow any chain as far as it 
goes. 

(One of the authors had this idea when looking at the very interesting article "Mathematical modal logic: a view of its 
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1.3.7 Various considerations on abstract semantics 

We can think of preferential structures as elements attacking each other: if x -< y, then x attacks y, so y cannot be minimal 
any more. Thus the non-monotonicity. 

In topology e.g., things are different. Consider the open interval (0,1), and its closure [0,1]. Sequences converging to 
or 1 "defend" or 1 - thus, the more elements there are, the bigger the closure will be, monotony. The open core has a 
similar property. The same is true for e.g. Kripke structures for modal logic: If y is in relation R with x, then any time y 
is there, x can be reached: y defends x. 

A neuron may have inhibitory (attackers) and excitatory (defending) inputs. In preferential models, we may need many 
attackers to destroy minimality of one model, provided this model occurs in several copies. 

Of course, in a neuron system, there are usually many attackers and many defenders, so we have here a rather complicated 
system. 

Abstractly, both defense and attack are combined in Gabbay's reactive diagrams, see |Gab 04 . and Section RDfl fpage [T9|) . 
Now, back to our 

1.3.7.1 Set functions 

We can make a list of possible formal properties of such functions, which might include (U is the universe or base set): 

f(X) = X, f(X) CX,XC f(X) 

f(X) = /(/(*)), f(X) C f{f{X)). f(f{X)) C f(X) 

X + - f(X) + 

x + u -> f{x) n x = 
x + -» f(x) n x ^ 

f(C(X)) = C(f(X)) 
f(XUY)=f(X)Uf(Y) 

f(X U Y) = f(X) or f(Y) or f(X) U f(Y) (true in ranked structures) 
X C Y — > f(Y) flic f(X) (attack, basic law of preferential structures) 
iCF-t f(X) C f(Y) (defense, e.g. Modal Logic) 

f(X) C Y C X — > f(X) = f(Y) (smoothness), holds also for the open core of a set in topology 
X C Y C f(X) — > f(X) — f(Y) counterpart, holds for topological closure 

(Note that the last two properties will also hold in all other situations where one chooses the biggest subset or smallest 
superset from a set of candidates.) 

WX) = \J{f(X):XeX} 
etc. 

General, distance based revision is a two argument set function (in the AGM approach, it has only one argument) : 
M{K) | M{<t>) C M(<j>) 

This is non-monotonic in both arguments, as the result depends more on the "shape" of both model sets than their size. 
Counterfactuals (and update) are also two argument set functions: 

M (4>) I M(ip) is defined as the set of ip— models closest to some individual 0— model, here the function is monotonic in the 
first argument, and non-monotonic in the second - we collect for all m € M((f>) the closest ^—models. 

1.3.8 A comparison with Reiter defaults 

The meaning of Reiter defaults differs from that of preferential structures in a number of aspects. 

(1) The simple (Reiter) default "normally <f> " does not only mean that in normal cases <j> holds, but also that, if holds, 
then normally also 4> Aip holds. It thus inherits "normally (f> " down on subsets. 

(2) Of course, this is itself a default rule, as we might have that for ip— cases, normally ^<fi holds. But this is a meta-default. 

(3) Defaults can also be concatenated, if normally <j> holds, and normally, if <\> holds, then also ip holds, we conclude that 
normally (j) A ip holds. Again, this is a default rule. 

Thus, Reiter defaults give us (at least) three levels of certainty: classical information, the information directly expressed 
by defaults, and the information concluded by the usual treatment of defaults. 



1.4 IBRS 



1.4.1 Definition and comments 
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(1) An information bearing binary relation frame IBR, has the form (S, 5ft), where S is a non-empty set and 5ft is a subset 
of S, where S is defined by induction as follows: 

(1.1) So = S 

(1.2) S n+ i = S n U{S n x S n ). 

(1.3) S = \J{S n -.neuj} 

We call elements from S points or nodes, and elements from 5ft arrows. Given (S, 5ft), we also set P((S, 5ft)) := S, and 
A((S*,5ft)) := 5R. 

If a is an arrow, the origin and destination of a are defined as usual, and we write a : x —> y when x is the origin, 
and y the destination of the arrow a. We also write o(a) and d(a) for the origin and destination of a. 

(2) Let Q be a set of atoms, and L be a set of labels (usually {0, 1} or [0, 1]). An information assignment h on (S, 5ft) is 
a function h : Q x 5R — » L. 

(3) An information bearing system IBRS, has the form (S, 5ft, h, Q, L), where S, 5ft, h, Q, L are as above. 
See Diagram 11.4.11 (page |2"0")) for an illustration. 



(p,i) = (i,i) 



e 




(p,g) = (0,0) (p,<?) = (l,0) 

A simple example of an information bearing system. 



Diagram 1.4.1 



We have here: 

S = {a, 6, c, d, e}. 

5ft = 5 U {(a, 6), (a, c), (d, c), (d, e)} U {((a, 6), (d, c)), (d, (a, c))}. 

<3 = {p, 

The values of /i for p and q are as indicated in the figure. For example h(p, (d, (a, c))) = 1. 



Comment 1.4.1 

The elements in Figure Diagram ll.4.11 (page [20]) can be interpreted in many ways, depending on the area of application. 



(1) The points in S can be interpreted as possible worlds, or as nodes in an argumentation network or nodes in a neural 
net or states, etc. 

(2) The direct arrows from nodes to nodes can be interpreted as accessibility relation, attack or support arrows in an 
argumentation networks, connection in a neural nets, a preferential ordering in a nonmonotonic model, etc. 

(3) The labels on the nodes and arrows can be interpreted as fuzzy values in the accessibility relation or weights in the 
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(4) The double arrows can be interpreted as feedback loops to nodes or to connections, or as reactive links changing the 
system which are activated as we pass between the nodes. 



Thus, IBRS can be used as a source of information for various logics based on the atoms in Q. We now illustrate by listing 
several such logics. 



Modal Logic 

One can consider the figure as giving rise to two modal logic models. One with actual world a and one with d, these being 
the two minimal points of the relation. Consider a language with Uq. how do we evaluate a \= □(?? 

The modal logic will have to give an algorithm for calculating the values. 

Say we choose algorithm A\ for a \= Oq, namely: 

[ Ai(a, Oq) = 1 ] iff for all x e S such that a = x or (a, x) e 5ft we have h(q, x) = 1. 

According to Ai we get that Oq is false at a. Ai gives rise to a T— modal logic. Note that the reflexivity is not anchored 

at the relation 5ft of the network but in the algorithm Ai in the way we evaluate. We say (S, 5ft, ) |= □ q iff Uq holds in 

all minimal points of (S, 5ft) . 

For orderings without minimal points we may choose a subset of distinguished points. 
Nonmonotonic Deduction 

We can ask whether p |~ q according to algorithm Ai defined below. Ai says that p |~ q holds iff q holds in all minimal 
models of p. Let us check the value of A2 in this case: 

Let S p = {seS\ h{p, s) = 1}. Thus S p = {d, e}. 

The minimal points of S p are {d}. Since h(q,d) = 0, we have that p q. 

Note that in the cases of modal logic and nonmonotonic logic we ignored the arrows (d, (a, c)) (i.e. the double arrow from 
d to the arc (a, c)) and the h values to arcs. These values do not play a part in the traditional modal or nonmonotonic 
logic. They do play a part in other logics. The attentive reader may already suspect that we have her an opportunity for 
generalisation of say nonmonotonic logic, by giving a role to arc annotations. 



Argumentation Nets 

Here the nodes of S are interpreted as arguments. The atoms {p, q} can be interpreted as types of arguments and the 
arrows e.g. (a, b) <E 5K as indicating that the argument a is attacking the argument b. 

So, for example, let 

a = we must win votes. 

b = death sentence for murderers. 

c = We must allow abortion for teenagers 

d = Bible forbids taking of life. 

q = the argument is a social argument 

p = the argument is a religious argument. 

(d, (a, c)) = there should be no connection between winning votes and abortion. 

((a, b), (d, c)) = If we attack the death sentence in order to win votes then we must stress (attack) that there 
should be no connection between religion (Bible) and social issues. 

Thus we have according to this model that supporting abortion can lose votes. The argument for abortion is a social one 
and the argument from the Bible against it is a religious one. 

We can extract information from this IBRS using two algorithms. The modal logic one can check whether for example 
every social argument is attacked by a religious argument. The answer is no, since the social argument b is attacked only 
by a which is not a religious argument. 

We can also use algorithm A3 (following Dung) to extract the winning arguments of this system. The arguments a and d 
are winning since they are not attacked, d attacks the connection between a and c (i.e. stops a attacking c). 

The attack of a on b is successful and so b is out. However the arc (a, b) attacks the arc (d,c). So c is not attacked at all 
as both arcs leading into it are successfully eliminated. So c is in. e is out because it is attacked by d. 

So the winning arguments are {a, c, d} 

In this model we ignore the annotations on arcs. To be consistent in our mathematics we need to say that h is a partial 
function on 5ft. The best wav is to give more SDecific definition on IBRS to make it suitable for each loeic. 
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Counterfactuals 

The traditional semantics for counterfactuals involves closeness of worlds. The clauses y \= p q, where ^ is a counter- 
factual implication is that q holds in all worlds y' "near enough" to y in which p holds. So if we interpret the annotation 
on arcs as distances then we can define "near" as distance < 2, we get: a \= p ^ q iff in all worlds of p~ distance < 2 if p 
holds so does q. Note that the distance depends on p. 

In this case we get that a \= p q holds. The distance function can also use the arrows from arcs to arcs, etc. There are 
many opportunities for generalisation in our IBRS set up. 

Intuitionistic Persistence 

We can get an intuitionistic Kripke model out of this IBRS by letting, for t,s G S, tpos iff t — s or [tRs A Vq E Q(h(q, t) < 
h(q,s))]. We get that 

[ ro = {{y, y) | y e S] U {(a, 6), (a, c), (d, e)}. ] 

Let p be the transitive closure of po- Algorithm A4 evaluates p =$> q in this model, where => is intuitionistic implication. 
At : p =4> q holds at the IBRS iff p =>■ g holds intuitionistically at every p— minimal point o/(5, p). 

1.4.2 The power of IBRS 

We show now how a number of logics fit into our general picture of IBRS. 

(1) Nonmonotonic logics in the form of preferential logics: 

There are only arrows from nodes to nodes, and they are unlabelled. The nodes are classical models, and as such all 
propositional variables of the base language are given a value from {0, 1}. 

The structure is used as described above, i.e. the R— minimal models of a formula or theory are considered. 

(2) Theory Revision 

In the full case, i.e. where the left hand side can change, nodes are again classical models, arrows exist only between 
nodes, and express by their label the distance between nodes. Thus, there is just one (dummy) p, and a real value 
as label. In the AGM situation, where the left hand side is fixed, nodes are classical models (on the right) or sets 
thereof (on the left), arrows go from sets of models to models, and express again distance from a set (the K— models 
in AGM notation) to a model (of the new formula (j>). 
The structure is used by considering the closest 0— models. 

The framework is sufficiently general to express revision also differently: Nodes are pairs of classical models, and 
arrows express that in pair (a, b) the distance from a to b is smaller than the distance in the pair (a', b'). 

(3) Theory Update 

As developments of length 2 can be expressed by a binary relation and the distance associated, we can - at least in 
the simple case - proceed analogously to the first revision situation. It seems, however, more natural to consider as 
nodes threads of developments, i.e. sequences of classical models, as arrows comparisons between such threads, i.e. 
unlabelled simple arrows only, expressing that one thread is more natural or likely than another. 

The evaluation is then by considering the "best" threads under above comparison, and taking a projection on the 
desired coordinate (i.e. classical model). The result is then the theory defined by these projections. 

(4) Deontic Logic 

Just as for preferential logics. 

(5) The Logic of Counterfactual Conditionals 

Again, we can compare pairs (with same left element) as above, or, alternatively, compare single models with respect 
to distance from a fixed other model. This would give arrows with indices, which stand for this other model. 

Evaluation will then be as usual, taking the closest 0— models, and examining whether xp holds in them. 

(6) Modal Logic 

Nodes are classical models, and thus have the usual labels, arrows are unlabelled, and only between nodes, and 
express reachability. 

For evaluation, starting from some point, we collect all reachable other models, perhaps adding the point of departure. 

(7) Intuitionistic Logic 
Just as for modal logic. 

(8) Inheritance Systems 

Nodes are properties (or sets of models), arrows come in two flavours, positive and negative, and exist between nodes 
only. 

The evaluation is relatively complicated, and the subject of ongoing discussion. 

(9) Argumentation Theory 

There is no unique description of an argumentation system as an IBRS. For instance, an inheritance system is an 
argumentation system, so we can describe such a system as detailed above. But an argument can also be a deontic 
statement, as we saw in the first part of this introduction, and a deontic statement can be described as an IBRS 
itself. Thus, a, node can be. under finer sraxmlaritv. itself an TBRS. Labels can describe the tvne of areument (social. 
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1.4.3 Abstract semantics for IBRS and its engineering realization 
1.4.3.1 Introduction 

(1) Nodes and arrows 

As we may have counterarguments not only against nodes, but also against arrows, they must be treated basically the 
same way, i.e. in some way there has to be a positive, but also a negative influence on both. So arrows cannot just be 
concatenation between the contents of nodes, or so. 

We will differentiate between nodes and arrows by labelling arrows in addition with a time delay. We see nodes as situations, 
where the output is computed instantenously from the input, whereas arrows describe some "force" or "mechanism" which 
may need some time to "compute" the result from the input. 

Consequently, if a is an arrow, and an arrow pointing to a, then it should point to the input of a, i.e. before the time 
lapse. Conversely, any arrow originating in a should originate after the time lapse. 

Apart this distinction, we will treat nodes and arrows the same way, so the following discussion will apply to both - which 
we call just "objects" . 

(2) Defeasibility 

The general idea is to code each object, say X, by I(X) : U(X) — > C(X) : If I(X) holds then, unless U(X) holds, 
consequence C(X) will hold. (We adopted Rcitcr's notation for defaults, as IBRS have common points with the former.) 

The situation is slightly more complicated, as there can be several counterarguments, so U(X) really is an "or" . Likewise, 
there can be several supporting arguments, so I{X) also is an "or". 

A counterargument must not always be an argument against a specific supporting argument, but it can be. Thus, we should 
admit both possibilties. As we can use arrows to arrows, the second case is easy to treat (as is the dual, a supporting 
argument can be against a specific counterargument). How do we treat the case of unspecific pro- and counterarguments? 
Prob ably the easiest way is to adopt Dung's idea: an object is in, if it has at least one support, and no counterargument 
- see |Dun95j . Of course, other possibilities may be adopted, counting, use of labels, etc., but we just consider the simple 
case here. 

(3) Labels 

In the general case, objects stand for some kind of defeasible transmission. We may in some cases see labels as restricting 
this transmission to certain values. For instance, if the label is p = 1 and q — 0, then the p— part may be transmitted and 
the q— part not. 

Thus, a transmission with a label can sometimes be considered as a family of transmissions, which ones are active is 
indicated by the label. 

Example 1.4.1 

In fuzzy Kripke models, labels are elements of [0, 1]. p = 0.5 as label for a node m' which stands for a fuzzy model means 
that the value of p is 0.5. p = 0.5 as label for an arrow from m to m! means that p is transmitted with value 0.5. Thus, 
when we look from m to m', we see p with value 0.5 * 0.5 = 0.25. So, we have Op with value 0.25 at m - if, e.g., m, m' are 
the only models. 

(4) Putting things together 

If an arrow leaves an object, the object's output will be connected to the (only) positive input of the arrow. (An arrow 
has no negative inputs from objects it leaves.) If a positive arrow enters an object, it is connected to one of the positive 
inputs of the object, analogously for negative arrows and inputs. 

When labels are present, they are transmitted through some operation. 



1.4.3.2 Formal definition 
Definition 1.4.2 

In the most general case, objects of IBRS have the form: ((Ii, L\), . . . , (I n ,L n )) : ((J7i, L'^j, . . . , (U n , L' n )), where the L i ,L' i 
are labels and the Ii,Ui might be just truth values, but can also be more complicated, a (possibly infinite) sequence of 
some values. Connected objects have, of course, to have corresponding such sequences. In addition, the object X has a 
criterion for each input, whether it is valid or not (in the simple case, this will just be the truth value "true"). If there 
is at least one positive valid input Ii, and no valid negative input Ui, then the output C(X) and its label are calculated 
on the basis of the valid inputs and their labels. If the object is an arrow, this will take some time, t, otherwise, this is 
instantaneous. 

Evaluating a diagram 

An evaluation is relative to a fixed input, i.e. some objects will be given certain values, and the diagram is left to calculate 
the others. It may well be that it oscillates, i.e. shows a cyclic behaviour. This may be true for a subset of the diagram, 
or the whole diagram. If it is restricted to an unimportan t part , we migh t neglect this. Whether it oscillates or not can 
also depend on the time delays of the arrows (see Example 11.4.21 (page [24]) ) . 



We therefore define for a diagram A 
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(a) a is a (perhaps partial) input - where the other values are set "not valid" 

(b) [3 is a (perhaps partial) output 

(c) after some time, (3 is stable, i.e. all still possible oscillations do not affect (3 

(d) the other possible input values do not matter, i.e. whatever the input, the result is the same. 
In the cases examined here more closely, all input values will be defined. 



1.4.3.3 A circuit semantics for simple IBRS without labels 



It is standard to implement the usual logical connectives by electronic circuits. These components are called gates. Circuits 
with feedback sometimes show undesirable behaviour when the initial conditions are not specified. (When we switch a 
circuit on, the outputs of the individual gates can have arbitrary values.) The technical realization of these initial values 
shows the way to treat defaults. The initial values are set via resistors (in the order of 1 fcf2) between the point in the 
circuit we want to intialize and the desired tension (say Volt for false, 5 Volt for true). They are called pull-down or 
pull-up resistors (for default or 5 Volt). When a "real" result comes in, it will override the tension applied via the resistor. 

Closer inspection reveals that we have here a 3 level default situation: The initial value will be the weakest, which can be 
overridden by any "real" signal, but a positive argument can be overridden by a negative one. Thus, the biggest resistor 
will be for the initialization, the smaller one for the supporting arguments, and the negative arguments have full power. 

Technical details will be left to the experts. 

We give now an example which shows that the delays of the arrows can matter. In one situation, a stable state is reached, 
in another, the circuit begins to oscillate. 



Example 1.4.2 

(In engineering terms, this is a variant of a JK flip-flop with R * S = 0, a circuit with feedback.) 
We have 8 measuring points. 

Inl,In2 are the overall input, Outl,Out2 the overall output, Al, A2, A3, AA are auxiliary internal points. All points can 
be true or false. 

The logical structure is as follows: 
Al = Inl A Outl, A2 = In2 A Out2, 
A3 = Al V Out2, A4 = A2 V Outl, 
Outl = -nA3, Out2 = ~nA4. 

Thus, the circuit is symmetrical, with Inl corresponding to In2, Al to A2, A3 to A4, Outl to Out2. 
The input is held constant. See Diagram II. 4. 21 (page [24 ]) . 
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> Outl 



> Outl 



Gate Semantics 



Diagram 1.4.2 



We suppose that the output of the individual gates is present n time slices after the input was present, n will in the first 
circuit be equal to 1 for all gates, in the second circuit equal to 1 for all but the AND gates, which will take 2 time slices. 
Thus, in both cases, e.g. Outl at time t will be the negation of A3 at time t — 1. In the first case, Al at time t will be the 
conjunction of Inl and Outl at time t — 1, and in the second case the conjunction of Inl and Outl at time t — 2. 

We initialize Inl as true, all others as false. (The initial value of A3 and A4 does not matter, the behaviour is essentially 
the same for all such values.) 

The first circuit will oscillate with a period of 4, the second circuit will go to a stable state. 
We have the following transition tables (time slice shown at left): 
Circuit 1, delay — 1 everywhere: 





Inl 


In2 


Al 


A2 


A3 


A4 


Outl 


Out2 




1 


T 


F 


F 


F 


F 


F 


F 


F 




2 


T 


F 


F 


F 


F 


F 


T 


T 




3 


T 


F 


T 


F 


T 


T 


T 


T 




4 


T 


F 


T 


F 


T 


T 


F 


F 




5 


T 


F 


F 


F 


T 


F 


F 


F 


oscillation starts 


6 


T 


F 


F 


F 


F 


F 


F 


T 




7 


T 


F 


F 


F 


T 


F 


T 


T 




8 


T 


F 


T 


F 


T 


T 


F 


T 




9 


T 


F 


F 


F 


T 


F 


F 


F 


back to start of oscillation 



Circuit 2. delav — 1 everywhere. cxceDt for AND with delay = 2 : 



2G 
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Inl 


In2 


Al 


A2 


A3 


A4 


Outl 


0ut2 


1 


T 


F 


F 


F 


F 


F 


F 


F 


2 


T 


F 


F 


F 


F 


F 


T 


T 


3 


T 


F 


F 


F 


T 


T 


T 


T 


4 


T 


F 


T 


F 


T 


T 


F 


F 


5 


T 


F 


T 


F 


T 


F 


F 


F 


6 


T 


F 


F 


F 


T 


F 


F 


T 


7 


T 


F 


F 


F 


T 


F 


F 


T 


8 


T 


F 


F 


F 


T 


F 


F 


T 



stable state reached 



Note that state 6 of circuit 2 is also stable in circuit 1, but it is never reached in that circuit. 



Chapter 2 

Basic definitions and results 



2.1 Algebraic definitions 

Notation 2.1.1 

We use sometimes FOL as abbreviation for first order logic, and NML for nonmonotonic logic. To avoid Latex complications 
in bigger expressions, we replace xxxxx by xxxxx. 

Definition 2.1.1 

We use V to denote the power set operator, H{Xi : i G /} := {g : g : / — > U{^i Vi £ I-g(i) S Xi} is the general 

cartesian product, card(X) shall denote the cardinality of X, and V the set-theoretic universe we work in - the class of all 
sets. Given a set of pairs X, and a set X, we denote by X\X :— {(x, i) G X : x G X}. When the context is clear, we will 
sometime simply write X for X\X. (The intended use is for preferential structures, where x will be a point (intention: a 
classical propositional model), and i an index, permitting copies of logically identical points.) 

A C B will denote that A is a subset of B or equal to B, and A C B that A is a proper subset of B, likewise for A D B 
and Az) B. 

Given some fixed set U we work in, and X C U, then C(X) :— U — X . 

If y Q ~P(X) for some X, we say that y satisfies 

(n) iff it is closed under finite intersections, 

(p|) iff it is closed under arbitrary intersections, 

(U) iff it is closed under finite unions, 

(1J) iff it is closed under arbitrary unions, 

(C) iff it is closed under complementation, 

(— ) iff it is closed under set difference. 

We will sometimes write A = B || C for: A = B, or A = C, or A = B U C. 
We make ample and tacit use of the Axiom of Choice. 

Definition 2.1.2 

^* will denote the transitive closure of the relation -< . If a relation <, -<, or similar is given, a_L6 will express that a and 
b are < — (or -< — ) incomparable - context will tell. Given any relation <, < will stand for < or =, conversely, given <, 
< will stand for <, but not =, similarly for -< etc. 

Definition 2.1.3 

A child (or successor) of an element a; in a tree t will be a direct child in t. A child of a child, etc. will be called an indirect 
child. Trees will be supposed to grow downwards, so the root is the top element. 

Definition 2.1.4 

A subsequence Oi : i G / C /i of a sequence <Ji : i G /i is called cofinal, iff for all i € n there is i' G / i < i' . 

Given two sequences Oi and Tj of the same length, then their Hamming distance is the quantity of i where they differ. 

Definition 2.1.5 

Let y C V(Z) be given and closed under arbitrary intersections. 

(1) For AC Z, let 'A* := f]{X G y : A C X}. 

(2) For B G y, we call A <Z B a, small subset of B iff there is no X G y such that B - A C X C B. 
(Context will disambiguate from other uses of "small" .) 

Intuitively, Z is the set of all models for C, y is Dc, and A = M(Th(A)), this is the intended application - Th(A) is 
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Fact 2.1.1 

1) If y Q 'P(Z) is closed under arbitrary intersections and finite unions, Z G y, X, Y C Z, then the following hold: 

cm) xaJy = 

ClD) X^fVY C ^T~n^Y^, but usually not conversely, 
CI—) 'T-'^cI^B, 

CI —) X = Y => X = Y , but not conversely, 
CI CI) C Y => X C Y", but not conversely, 
Ci C 2) 1 C '"y > X C Y . 

2) If, in addition, X E y and CX := Z — X G y, then the following two properties hold, too: 

CI n +) ^A~nx = A~nx, 
CI - +) ^ -x = A~^x . 

3) In the intended application, i.e. A — M(Th(A)), the following hold: 

3.1) Th(X) = Th( X ), 

3.2) Even if A = A , B = B , it is not necessarily true that A — B C A — 5 . 
Proof 

C7 =), (CZ C 1), (CI C 2), (3.1) are trivial. 

CZU) Let y(Z7) := {X e y : U C X}. E A e y(X U F), then A G y(X) and A G y(y), so TuT D ^U^. If 
A G y(X) and B G y(Y), then AUB G y(lur), solU?C ^U^. 

C7n) LetX',y' G y, X C X ', Y C F', thenXnr C X'ny',soXnl > C / ^ N n'^ N .Forthe converse, set X := Af £ -{m}, 
y := {m} in Example 1 2. 2. II (page [30]). (Mc is the set of all models of the language C.) 

CI-) Let A — BCXey, BCY^y,soACXUYey. Let x g '"eT => By 6 y(s C y, x £ y), x X^~B 
4>3l£ y(A - B C , a; ^ X), so x ^ X U Y, A C X U Y, so x ^ A . Thus, x # B , x <£ A~*~B .t ^ A , or 
; e ^^-'"fT =>• x G X^~B. 

C*Z n +) 2 ArTx by (CZn). For " C ": Let ADX C A' e y, then by closure under (U), i C i' U CX G y, 

i'ucijni a'. So^nx cTnx. 

Cl — +) A — X = AnCX =^A^nCX = A —X by (czn+). 

3.2) Set A := Mc, B := {m} for m G M £ arbitrary, £ infinite. So A = ""yT, B = S , but A~^~B = A^ A-B. 
□ 



2.2 Basic logical definitions 

Definition 2.2.1 

We work here in a classical prepositional language C, a theory T will be an arbitrary set of formulas. Formulas will often 
be named </>, tp, etc., theories T, S, etc. 

v(C) will be the set of propositional variables of C. 

Mc will be the set of (classical) models for C, M(T) or Mt is the set of models of T, likewise M(<p) for a formula <j>. 
Dc ■= {M(T) : T a theory in £}, the set of definable model sets. 

Note that, in classical propositional logic, 0, Mc G Dc, Dc contains singletons, is closed under arbitrary intersections and 
finite unions. 

An operation / : y — > V(Mc) for y C V(Mc) is called definability preserving, (dp) or (/idp) in short, iff for all X G £>£ ny 
f(X)GDc. 
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h will be classical derivability, and 
T :={(/)■. T h- <p}, the closure of T under h . 

Con{.) will stand for classical consistency, so Con(</>) will mean that <j> is clasical consistent, likewise for Con(T) 
will stand for Con(T U T'), etc. 

Given a consequence relation |~, we define 
T :={</»: T |~ 

(There is no fear of confusion with T, as it just is not useful to close twice under classical logic.) 
TVT' := {4>V<f>' : 4> £ T,4>' £ T'}. 

If X C M £ , then Th(X) ;= {<f) : X \= <f>}, likewise for Th(m), m £ M c . (\= will usually be classical validity.) 
We recollect and note: 
Fact 2.2.1 

Let £ be a fixed prepositional language, D c Q X, fi : X -> V(M C ), for a ^-theory T T := Th{^{M T )), let T, T' be 
arbitrary theories, then: 

(1) /i(M T ) C M=, 

(2) M T U M T / = M Tv t' and M TuT / = M T n M T /, 

(3) /i(M T ) = 1. £ T . 

If /i is definability preserving or //(My) is finite, then the following also hold: 

(4) M (M T ) = M=, 

(5) fhf «• M T < C n(M T ), 

(6) /x(M T ) = M T , & T 7 = f. □ 

Fact 2.2.2 

Let A,B C M c . 

Then Th(A U B) = Th(A) n Th(B). 
Proof 

£ 77i(A U5)^4UBh^^h^ and5 h^^ 77i(A) and £ Th(B). 
□ 



Fact 2.2.3 

Let X C Mc, 0, -0 formulas. 

(1) x n MO) h^iff^h^i 

(2) x n M(0) |= v iff M{Th{X)) n M (0) h 

(3) T/i(X n MO)) = Th(X) U {</>} 

(4) X n M(0) = & M(Th(X)) n MO) = 

(5) Th(M(T) n M(T')) = TUT 7 . 

Proof 

(1) " ": X = (X n MO)) U (X n M(-.0)). In both parts holds -.0 V V, so X |= -> V- "<*=": Trivial. 

(2) X n MO) \= ip (by (1)) iff X \= $ -» iff M(Th{X)) \= <\> -» iff (again by (1)) M{Th{X)) n M(</>) |= V- 

(3) v £ n MO)) «in MO) |= v <^ (2 ) M(Th(x) u {^}) = M(Th(x)) n MO) h V» r/i(X) u {</>} h ^. 

(4) X n MO) =^Ih^« M(Th(X)) \= -n(f> <=> M(Th(X)) n MO) = 0- 

(5) M(T) n M(T') = M(T U T'). 

□ 



Fact 2.2.4 
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Proof 

X C M(Th(X)) is trivial. Th(M{T)) = T is trivial by classical soundness and completeness. So M(Th(M (T)) = M (T) = 
M(T) =X.U 



Example 2.2.1 

If v(C) is infinite, and m any model for C, then M := Mc — {m} is not definable by any theory T. (Proof: Suppose it 
were, and let <fi hold in M, but not in m, so in m -xf> holds, but as cj> is finite, there is a model m! in M which coincides on 
all propositional variables of <f> with m, so in m! -xf> holds, too, a contradiction.) Thus, in the infinite case, V(Mc) ^ Dc- 

(There is also a simple cardinality argument, which shows that almost no model sets are definable, but it is not constructive 
and thus less instructive than above argument. We give it nonetheless: Let k := card(v(C)). Then there are k many 
formulas, so 2 K many theories, and thus 2 K many definable model sets. But there are 2 K many models, so (2 K ) K many 
model sets.) 

□ 



2.3 Basic definitions and results for nonmonotonic logics 



Logical rule j Correspondence j Model set | Correspondence | Size Rules 


Basics 


(6'C) Supraclassicality 

h =^ k; 

\HtLt ) Rcncxivity 
T U {a} k a 


(6'C) 
T C T 


=> (4.D 
<= (4-2) 


(My 

/(X) C X 


trivial 


(Opt) 


(LLE) 
Left Logical Equivalence 


{LLE) 
_ = = 










(RW) Right Weakening 
|~ ip,\~ — » =>■ 
|~ 


(RW) 
TU' 






t r l vial 




(u>0.R) 

|~ V) l~ ^ 

V 0' ki/) 


(toOR) 


=> (3.1) 


/(x uy)c /(x) u y 


<= (i-i) 


(eMT) 


T n T' C T V T' 


<= (3-2) 


=> (1-2) 


{ (lis J (Jrl) 

1 '0 , |~ 0, 

V k 


(atsjUK) 
-^Con(TUT') => 

T n T" C T V T' 


_^ ^2 i\ 


{piaisjUri) 
X n Y = => 

/(XUY) C /(X)U/(Y) 


f4 l"l 


(I U disj) 


<= (2-2) 


=> (4-2) 


(CP) 

Consistency Preservation 

|~ _L =^ h ± 


(CP) 
T |~ _L => T h _L 


=> (5-1) 
<= (5-2) 


(m0) 

/(X) = => X = 


trivial 


CO 








(M0/'«) 
X / => /(X) 
for finite X 




(/i) 




(iWDi) 








U2) 




(AXD„) 
a |~ . . . . a [~ 0n — l 
ct W f — V V -ifl 1 1 

" I" V Ml v ... v >f->n — ll 










(AND) 

(p |~ , ~ =>■ 
|~ A 


(AN D) 
T (~ V> T (~ V' => 
T WAi/i' 






t r l vial 




{CCL) Classical Closure 


_ (CCL) 

T classically closed 






trivial 


(iM) + (J„) 


(OR) 

(f> \^ xjj. (p f ip 

V t/j 


(Oio 


=> (1-1) 


(jiOR) 
f(xuY) c /(x)u/(r) 


<= (2-1) 


(eMT) + (!„) 


T n T' C T V T' 


<= (1-2) 


=> (2-2) 






=* (6.D 


x c y => 
/(y)nx c /(X) 


*= (3.1) 


(eMI) + (J„) 


A 0' C U {0'} 


TUT'CTUT' 


<= (tidp) + ( M C) (6.2) 
if= without (fidp) (6.3) 
^(p C) (6.4) 
T a formula 


=> (3-2) 


<i= (6.5) 
T a formula 


( M Pff) 
/(x)nyc/(xny) 


(Ct/T) 
T ~ a; T U {a} |~ /3 
T h/3 


(Ct/T) 
T C T' C T =>■ 
T 7 C T 


=> (7-1) 


(juCt/T) 
/(X) C Y C X => 

/(X) c /(y) 


<= (8-1) 


(eMI) + (J„) 


<= (7-2) 


^ (8.2) 



o t? 
xs 



CD Q 
°crq 

CD r+ 

D- po 
P o 

c-t- h-« 
CD rt- 

I- 

P-. CD 

u CO 
tr tfs 

o 

2 CO 



cd 

r+ P 

tr ex 

Crq cd 

g CO 

Cfl_ Hj 

CL CO 
CD O 

2. CD 



SC. O 



O ^ 

o a 

CD ^ — ' 

Is 

sr » 



trj tr 
to g' 

CO H 



tr 

a 



p. 
£' 

CD 



3 H 
tr 
o ° 

crS, 

CD c+ 

>— . • tr 



: rri 

P CO 



P=i o 
:: CD 



CD 



Oct 

tr tr 

fo CD 

X co 
' ft 

cr 

si 

CD ^ 

. .CD 



p. 

fo 
P 

a. 
a- 



5 CD 
CD 1 CO 

S CD 



ffq co 

i'l 

tr 
o p 
SS Q 



fo CD 2 
H CO fL 

n, ft crq 



co o 
i — i ^ 
ctX 
3 

c° 2L 
~ tr 5 

CD 
CO H 
O CD 

e& 

CD PL 

i-i CO 
CD c+ 
CO ^5 

at 

cd a 
is 



Logical 


'ulc 


CJorrcspondcncc 


Model set 


(Jorrcspondcncc 


yizc-Kulc 


(Jumulativity 


(oit'M) 

a |~ /?, a' h a, a A /3 h a' a 7 (~ /3 








trivial 


(eM^) 


(CM 2 ) 

a k /9, a |~ /3' =^ a A /3 ^ tS' 












(CM n ) 
a y*j /3i, . . . , a (~ y9 n 
a A j9i A ... A 1/ -n^n 












(C A/) Cautious Monotony 


(CM) 


(8-1) 


(fiCM) 


^ (5.1) 


{M+){A) 




T C T 7 C T ^ 


<S= (8.2) 


f(X) CYCX ^ 


=> (5-2) 




A V j~ V 7 


T C T 7 




fOr) c /(x) 






or (KesM) .Restricted Monotony 




=>• (9.1) 


(juKesJW ) 






T |~ a, (3 T U {a} |~ (3 






/(x) anB=> /(x n A) c s 






{OUAl) Oumulativity 




(11.1) 


1/lOUM) 


■<= ( 9 -l) 


(eMX) + (i w ) + (AtJ)(4) 




T C T 7 C T 


<= (11-2) 


/(X) C Y C X => 


(9.2) 




ip' <p A ip |~ 


T = T 7 




/(y) = /(x) 










^ (10.1) 


(/* CD) 


<= (10.1) 


(eMZ) + (/„) + (eMf) 




T CT'. T' C T =>■ 


<= (10.2) 


/(x) c c x => 


5* (10.2) 






T 7 = T 




/(X) = /(F) 






Rationality 


{RatM) Rational Monotony 


(RatM) 


=> (12.1) 


(ixRatM) 


<= (6.D 


(M ++ ) 














(p A ip' |~ V 


T D T 7 u T 


j(= without f/irfp) (12.3) 


/(X) c/(y)nx 










T a formula {12 A) 










(RatM =) 


=>• (13.1) 


(/*=) 








Coti(TuF), T h T' =>- 


<= (jiirfp) (13.2) 


x c y. x n /(y) ?5 








T = ?UT 


j)t without (/t<ip) (13.3) 


/(x) = /(y)nx 










•4= 'J' a tormula (13.4) 










(Log =') 


=!> (14.1) 


(p =') 








Con(T> UT) => 


<= (MP) (14.2) 


/(y)nx#e^ 








T U T" = T' U T 


without {pdp) (14.3) 


/(y n X) = /(y) n X 










•4= '1' a tormula (14.4) 










(i°9 II) 


^ (15.1) 


(p II) 








T V T' is one of 


<= (15.2) 


/(X U y) is one of 








T, or T 7 , orfnF (by (CCL)) 




/(x), /(y) or /(x)u/(y) 








(LogU) 


(// c=) + (M =) (16.1) 


l>U) 








Con(T'UT), nCon(f 7 UT) 


(/*dp) (16.2) 


fOn n (x - /(X)) ^ => 








-.Con(T VT'U T') 


^ without (/top) (16.3) 


/(x u y) n y = 








(£ogU') 


=>• (m C) + (m =) (17.1) 


Ou') 








Con{T' U T), ^Con(T' U T) => 


<= bidp) (17.2) 


/(y)n(x-/(x))#0^ 








T V T 1 = T 


^ without (/t(ip) (17.3) 


/(xuy) = /(X) 












(m e) 

a e X - /(X) => 
3fc £ X.a /({a, 6}) 







s 

CO 



CO 

I 
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Basics 


1.1 


ifiPR) 


=> (n) + (a* c) 


{jjlPR!) 


(1.2) 


<^ 


(2.1) 


{fiPR) 


=> (^) 


(fjOR) 


(2.2) 


^ (/Z C) + (-) 


(2.3) 


=► u* y 


(nwOR) 


(2.4) 




(3) 






(txCUT) 


(4) 


(H C) + {fj, CD) + {fiCUM)+ 
{uRatM) + (n) 




i^PR) 


CJumulativity 


(5.1) 


(fiCM) 


=> (n) + (AtC) 


(fiResM) 


(5.2) 


<= (iniin.) 


(6) 


{fiCM) + (pCUT) 




(pCUM) 


(7) 


in Q + (^o) 




{jiCUM) 


(8) 


(A* C) + (fiCUAd) + (n) 




(MCD) 


(9) 


(M c) + [pCUM) 




(At CD) 


Rationality 


(10) 


(/iRatM) + (/iPR) 




(M=) 


(11) 


(M =) 




(AtPii) + (fiRatM) 


(12.1) 


(M =) 


=> (n) + (AtC) 


(At =') 


(12.2) 




(13) 


(/xC) + (M=) 




(AtU) 


(14) 


(M C) + ( M 0) + (jj, =) 


=> (U) 


(At ||), (AtU'), (AtCt/M) 


(15) 




(-) of 3> 


(/*=) 


(16) 


(M II) + (p e) + (m^)+ 
(a* c) 


(U) + J 7 contains singletons 




(17) 


(fxCUM) + [fi=) 


=> (U) + y contains singletons 


(Ate) 


(18) 


{nCUM) + {n=) + (fj.C) 


=Mu) 


(Mil) 


(19) 


{fj,PR) + {fj,CUM) + (fi ||) 


=>■ sufficient, e.g. true in 


(At 4 


(20) 


(A* Q + (fJ-PR) + U* =) 


7^ 


(Mil) 


(21) 


(a* C) + (/*Pfl) + (/i ||) 


7^- (without (-)) 


(/*=) 


22 


(At C) + (a*^) + (a* ||) + 
(ji =) + (jjU) 


¥> 


U e) 

(thus not representability 
by ranked structures) 



Proof 

All sets are to be in y. 

(1.1) (uPR) + (n) + (a* C) => (fiPR 1 ) : 

By x n y c x and (/iPi?), /(x) ninrc /(x n f). By (p c) /(x) ny = /(x) niny. 

(1.2) (AtP-R') => (AtP-R) : 

LctIcy,soX = Xny, so by (uPR!) /(F) mc f(X n F) = /(X). 

(2.1) (//Pi?) + (a* C) => {pOR) : 

f(X U Y) C X U F by ( M C), so /(X U F) = (/(X U F) n X) U (/(X U F) n F) C /(X) U /(F). 

(2.2) ( M OP) + ( M C) + (-) => (fiPR) : 

Let X C F, X' := F - X. /(F) C /(X) U /(X') by (pOR), so /(F) n X C (/(X) nl)U (/(X') n X) = ( ^c) /(X) U - 
/(*)• 

(2.3) ((iPR) + {n C) =>• (AtwOiJ) : 
Trivial by (2.1). 

(2.4) (awoOE) + (At C) + (-) (ptPP) : 

Let X C F, X' := F - X. /(F) C /(X) U X' by ( M u>OP), so /(F) n X C (/(X) n X) U (X 1 n X) = (/1 c) /(X) U = /(X). 

(3) (AtPi?) (pCUT) : 

/(X) cyci^ /(x) c /(x) n f c /(f) b y ( m pp). 

(4) (/i C) + (At CD) + [fxCUM) + (fiRatM) + (n) ^4 (/iPP) : 
This is shown in Example 12.3.21 (page [35]) . 

(5.1) (fj,CM) + (n) + (/i C) =s> (jiResM) : 

Let /(X) C A n P, so /(X) C A, so by {ji C) /(X) CifllCI, soby (a*CM) /(A n X) C /(X) C P. 

(5.2) (uResM) => (jiCM) : 

We consider here the infinitary version, where all sets can be model sets of infinite theories. Let /(X) C F C X, so 
/(X) C F n /(X), so by (AtPesAf) /(F) = /(X n Y) C /(X). 

(6) (p,C'M) + (fiCUT) (jiCUM) : 
Trivial. 

(7) (liC) + (uCD) =>■ (uCUM) : 
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(8) (a» C) + (pCUM) + (n) =► CD) : 

Let /(D) C £, /(£) C D, so by (/i C) /(D) C D n D C D, f(E) C D n D C D. As /(D n D) is defined, so f(D) = 
f(DnE) = f(E) by (/xCC/M). 

(9) (/x C) + (fiCUM) & (ax CD) : 

This is shown in Example 12.3.11 (page |3"5|) . 

(10) (uRatM) + (fiPR) =>• (ix =) : 
Trivial. 

(11) (a« =) entails (ax.P.R) and (uRatM) : 
Trivial. 

(12.1) (ax =) =* (/x =') : 

Let /(Y) n X ^ 0, we have to show /(X flF) = /(F) n X. By (ax C) /(Y) C Y, so /(Y) n A = f(Y) (1 (X (1 Y), so by 

(ax =) /(Y) ni = f(Y) n (A n Y) = f(x n Y). 

(12.2) (ax =') =► (/x =) : 

Let icy, f(Y) then /(X) = f(Y nx) = f(Y) n X. 

(13) (ax C), (ax =) (axU) : 

If not, /(X U y) n y ^ 0, but /(y) n (X - f(X)) ^ 0. By (ll), (fJ,PR) holds, so f(X UY)nX C /(X), so ^ 

/(y) n (x - /(x)) c /(y) n (a - /(a u y)), so /(y) - /(x u y) ^ 0, so by (a* c) /(y) c y an d /(y) ? f{x u y) n y. 

But by (ax =) f(Y) = /(A U Y) n y a contradiction. 
(14) 

(Ax C), ( M 0), ( m =) => (^ ||) : 

If X or y or both are empty, then this is trivial. Assume then X U Y ^ 0, so by (/x0) /(XU7) 7^ 0. By (/j C) 
/(luy)ClU y so /(A U y) n X = and /(A U y) n y = together are impossible. Case 1, /(A U Y) n A ^ and 
/(Auy)ny ^ : By (a* =) /(Auy)nA = /(A) and /(Auy)ny = /(y), so by (ax C) f(XUY) = f(A)U/(y). Case 
2, /(A U y) n A ^ and /(A U y) n y = : So by (ax =) /(A UY) = /(A U y) n A = /(A). Case 3, /(A U Y) n A = 
and /(A U Y) n Y ^ : Symmetrical. 

(A* C), (ax0), (ax =) => (AxU') : 

Let /(y) n (A - /(A)) ^ 0. If A U Y = 0, then /(A U7)= /(A) = by (a* C). So suppose A U y ^ 0. By (13), 

/(a u y) n y = 0, so /(a ur)cxby(/i c). By (a/0), /(a u y) ^ 0, so /(a u y) n a ^ 0, and /(a u y) = /(A) by 

(A*-)- 

(Ax C), O), ( M =) (fxCUM) : 

Let /(y) C A C Y. If Y = 0, this is trivial by (ax C). If Y ^ 0, then by (a«0) - which is crucial here - /(y) ^ 0, so by 
/(y) C A /(y) n A ji 0, so by (ax =) f(Y) = f(Y) n A = /(A). 

(15) (a»C) + (ax II) =► (/x =): 

Let A C y, Afl/(y) ^ 0, and consider Y = AU(Y-X). Then /(y) = /(A) || /(y-X). As f(Y)nX ^ 0, /(y) = f(Y-X) 
is impossible. Otherwise, /(A) = f(Y) n A, and we are done. 

(16) (ax II) + (ax G) + (AxPi?) + (ax C) => (/x =) : 

Suppose A C y, a; £ /(y) n A, we have to show f(Y) n A = /(A). " C " is trivial by (xxP-R). " D " : Assume a £ f(Y) 
(by (ax C)), but a £ /(A). By (ax g) 36 G Ya g /({a, 6}). As a G /(A), by (axP-R), a G /({a, s}). By (/x ||), /({a, 6,4) = 
/({«> X Y) II /({&})• As a ^ /({a, 6, x}), /({a, 6, a}) = /({&}), so x g /({a, 6, x}), contradicting (jiPR), as a, 6, a; £ Y. 

(17) (/iCC/M) + (ax =) => (ax G) : 

Let a £ X - /(A). If /(A) = 0, then /({a}) = by (/xCJ/M). If not: Let 6 £ /(A), then a g /({a, 6}) by {ji =). 

(18) (axCC/M) + (ax =) + (axC) => ( At ||) : 

By {fiCUM), /(A urjaauF^ /(A) = /(A U y), and /(A U7)CFCluy4 /(y) = /(A U Y). Thus, if 
(ax II) were to fail, /(A U Y) % A, /(A U7)^y. but then by (ax C) /(A U y) n A ^ 0, so /(A) = /(A U y) n A, and 

/(a u y) n y ^ 0, so /(y) = /(a u y) n y by ( M =). Thus, /(a uy) = (/(a u y) n a) u (/(a u y) n y) = /(a) u /(y). 

(19) (axPP) + {iiCUM) + (ax II) =*■ (ax =) : 

Suppose (ax =) does not hold. So, by (fJ,PR), there are X,Y,y s.t. icyin /(y) ^ 0, y £ y - /(Y), y £ /(A). Let 
a £ A n /(Y). If /(Y) = {a}, then by (fxCUM) /(Y) = /(A), so there must be 6 £ /(Y), 6 ^ a. Take now Y', Y" s.t. 
Y = Y' U Y", a £ Y', a ^ Y", 6 £ Y", 6 £ Y', y £ Y' n Y". Assume now (ax ||) to hold, we show a contradiction. If 
2/ f(Y"), then by (axPE) y /(Y" U {a}). But /(Y" U {a}) - /(Y") || /({a, y}), so /(Y" U {a}) = /(Y"), contradicting 
a £ /(Y). If y £ /(Y"), then by /(Y) - /(Y') || f(Y"), f(Y) = f(Y>), contradiction as 6 ^ /(Y'). 

(20) (ax C) + (axPP) + (ax =) ^ (ax II) : 
See Example 12.3.31 (page [35]) . 

(21) (ax C) + (AiPi?) + (ax II) *(j*=): 
See Example |2"3~41 (page [35]) . 



2.3. BASIC DEFINITIONS AND RESULTS FOR NONMONOTONIC LOGICS 35 
See Example 12.3.51 (page . 

Thus, by Fact 14 . 2 ."71 ( page . the conditions do not assure representability by ranked structures. 
□ 



Remark 2.3.2 

Note that (// =') is very close to (RatM) : (RatM) says: a f~ j3, a \f ^7 =4> a A 7 |~ (3. Or, f(A) C B, f(A) n C =>• 
/(A nC)CB for all A, B, C. This is not quite, but almost: f(A n C) C /(A) n C (it depends how many £ there are, if 
f(A) is some such B, the fit is perfect). 

Example 2.3.1 

We show here (/jC)| (fxCUM) ^ (/j CD), 

Consider X := {a,6,c}, F := {a,6,d}, /(X) := {a}, /(F) := {a, 6}, ^ := {X,Y}. (If /({a, 6}) were defined, we would 
have f(X) = /({a, 6}) = /(F), contradiction.) 

Obviously, (^i C) and (uCUM) hold, but not (^ CD). 
□ 



Example 2.3.2 

We show here (fi C) + (fi CD) + (fiCUM) + (uRatM) + (n) ^ (/iPP). 

Let [/ := {a,6,c}. Let y = T{U). So (n) is trivially satisfied. Set f(X) := X for all X C U except for /({a, b}) = {b}. 
Obviously, this cannot be represented by a preferential structure and (fiPR) is false for U and {a, b}. But it satisfies (/1 C), 
(fxCUM), {uRatM). (u C) is trivial. (fxCUM) : Let f(X) CYCI.lf f(X) = X, we are done. Consider /({a, 6}) = {b}. 
If {&} C Y C {a, 6}, then /(F) — {b}, so we are done again. It is shown in Fact l2.3.T1 (page [32|) . (8) that (n CD) follows. 
(fiRatM) : Suppose X C F, X n /(F) 7^ 0, we have to show f(X) C /(F) n X. If /(F) = F, the result holds by X C F, so 
it does if X = F. The only remaining case is F = {a, 6}, X = {6}, and the result holds again. 
□ 



Example 2.3.3 

The example shows that (/j, C) + (/iPP) + (fj, =) 7^ (/x ||). 

Consider the following structure without transitivity: f := {a, 6, c, c?}, c and d have w many copies in descending order 

c\ >z C2 , etc. a, 6 have one single copy each, a >z b, a h d\, b >: a, b >z c\. (pi ||) does not hold: f(U) — 0, but 

/({a, c}) = {a}, /({&, d}) = {6}. (fiPR) holds as in all preferential structures, (u =) holds: If it were to fail, then for some 
AC B, f(B)C\A ^ 0, so f(B) ^ 0. But the only possible cases for B are now: (a 6 B, b,d & B) or (b G P, a, c g B). Thus, 
B can be {a}, {a, c}, {&}, {6, d} with /(B) = {a}, {a}, {6}, {6}. If A — B, then the result will hold trivially. Moreover, A 
has to be 7^ 0. So the remaining cases of B where it might fail are B = {a, c} and {b, d}, and by f(B) Pi A 7^ 0, the only 
cases of A where it might fail, are A = {a} or {6} respectively. So the only cases remaining are: B = {a, c}, A = {a} and 
B = {b, d}, A = {b}. In the first case, f(A) = f(B) = {a}, in the second f(A) = f(B) = {b}, but (/x =) holds in both. 

□ 



Example 2.3.4 

The example shows that (/iC) + (/«PB) + (|U ||) 7^ ((J> =)■ 

Work in the set of theory definable model sets of an infinite propositional language. Note that this is not closed under 
set difference, and closure properties will play a crucial role in the argumentation. Let U := {?/, a, Xi <u! }, where Xi — > a 
in the standard topology. For the order, arrange s.t. y is minimized by any set iff this set contains a cofinal subsequence 
of the Xi, this can be done by the standard construction. Moreover, let the Xj all kill themselves, i.e. with co many copies 
%l h %1 h ■ ■ ■ ■ There are no other elements in the relation. Note that if a £ u(X), then a $ X, and X cannot contain 
a cofinal subsequence of the Xi, as X is closed in the standard topology. (A short argument: suppose X contains such 
a subsequence, but a ^ X. Then the theory of a Th(d) is inconsistent with Th(X), so already a finite subset of Th(a) 
is inconsistent with Th(X), but such a finite subset will finally hold in a cofinal sequence converging to a.) Likewise, if 
y G u(X), then X cannot contain a cofinal subsequence of the Xi. 

Obviously, (n C) and {^PR) hold, but (/i =) does not hold: Set B := U, A := {a,y}. Then ju(-B) = {a}, u(A) — {a,y}, 
contradicting (u =). 

It remains to show that (/j, ||) holds. 

fi(X) can only be 0, {a}, {y}, {a, y}. As /i(A U5)C 11(A) U /i(B) by (/uP-R), 
Case 1, u(A UB) = {a, y} is settled. 
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Case 2: fj,(A U B) = {a}. 

Case 2.1: fi(A) = {a} - we are done. 

Case 2.2: /i(A) = {y} : A does not contain a, nor a cofinal subsequence. If /x(-B) = 0, then a $ B, so a $ A U B, a 
contradiction. If /u(-B) = {a}, we are done. If y G ^{B) 1 then y E B, but _B does not contain a cofinal subsequence, so 
A U -B does not either, so y G U -B), contradiction. 

Case 2.3: /x(A) = : A cannot contain a cofinal subsequence. If n(B) = {a}, we are done, a G fJ-(B) does have to hold, 
so u(B) = {a, y} is the only remaining possibility. But then B does not contain a cofinal subsequence, and neither does 
A U B, so y G U £?), contradiction. 

Case 2.4: = {a, y} : A does not contain a cofinal subsequence. If u(B) = {a}, we are done. If n(B) = 0, B does not 

contain a cofinal subsequence (as a g" B), so neither does A U -B, so y G U £?), contradiction. If y G u{B), B does not 
contain a cofinal subsequence, and we are done again. 

Case 3: a(A U B) = {y} : To obtain a contradiction, we need a G n(A) or a G n(B). But in both cases a G fi(A U B). 

Case 4: fi(A U B) = : Thus, A U -B contains no cofinal subsequence. If, e.g. y G n(A), then y G /i(^4 U .B), if a G (J,(A), 

then a G fi(A U S), so = 0. 

□ 



Example 2.3.5 

The example show that (// C) + (m-P-R) + (m II) + (a* =) + (/« u ) 7^ (a* £)• 

Let [/ := {y,Xi< w }, a sequence, each xi kills itself, x\ y xf >z ■■ ■ and y is killed by all cofinal subsequences of the Xi. 
Then for anyIC[/ fi(X) = or /x(X) = {y}. 

(jiC) and (fiPR) hold obviously. 

(/x ||) : Let A U B be given. If y g X, then for all F C X /i(F) = 0. So, if y £ A U B, we are done. If y G A U B, if 
u(A U B) = 0, one of A, _B must contain a cofinal sequence, it will have /i = 0. If not, then a(A U B) = {y}, and this will 
also hold for the one y is in. 

(fi =) : Let A C B, /it(-B) fl A ^ 0, show At(vl) = /i(-B) n A. But now /x(B) = {y}, y G A, so -B does not contain a cofinal 
subsequence, neither does A, so n(A) = {y}. 

(/iU) : (A - n jti(A') ^ 0, so /u(j4.') = {y}, so u(A U A') = 0, as y G A - jti(A). 

But (/x G) does not hold: y G U — ^(U), but there is no x s.t. y g" /i({x,y}). 

□ 



We turn to interdependencies of the different u— conditions. Again, we will sometimes use preferential structures in our 
arguments. 

Fact 2.3.3 

(fxwOR) + (nC)^ f(X U Y) C f(X) U f(Y) U (X n F) 



Proof 

f(x u f) c /(x) uy,/(iuy)ciu /(f), so /(x u y) c (/(x) u f) n (x u /(r)) = /(x) u /(f) u (x n f) □ 



Proposition 2.3.4 

The following table is to be read as follows: 

Let a logic |- satisfy (LLE) and (CCL), and define a function / : D £ -> £> £ by f(M(T)) := M(T). Then / is well 
defined, satisfies (/wZp), and T = Th(f(M(T) j). 

If |~ satisfies a rule in the left hand side, then - provided the additional properties noted in the middle for => hold, too - 
/ will satisfy the property in the right hand side. 

Conversely, if / : y — > V(Mc) is a function, with Dc C and we define a logic |~ by T := Th(f(M(T))), then |~ satisfies 
(LLE) and (CCL). If / satisfies (jidp), then f(M(T)) = M(f ). 

If / satisfies a property in the right hand side, then - provided the additional properties noted in the middle for <= hold, 
too - |~ will satisfy the property in the left hand side. 
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Basics 


(1.11 


(OR) 


=> 


(pOR) 


(1.2) 


<^ 


(2.1) 


(disjOR) 


=> 


(fidisjOR) 


(2.2) 


<^ 


(3.1) 


(wOR) 


=> 


(pwUR) 


(3.2) 


<^ 


(4.1) 


(SO) 


=> 




(4.2) 


<^ 


(5.1) 


(CP) 


=> 


(piD) 


(5.2) 


<= 


(6.1) 


(PR) 


=> 


(pPR) 


(6.2) 


<= (pdp) + (m Q 


(6.3) 


without (j-idp) 


(6.4) 


<= u»y 

T' a formula 


(6.5) 


(PR) 


<^ 

T 1 a formula 


(pPR') 


(7.1) 


(CUT) 


=> 


(pCUT) 


(7.2) 


<^ 


CJumulativity 


(8.1) 


(CM) 


=> 


(pCM) 


(8.2) 




(9.1) 


(ResM) 




(pResM) 


(9.2) 




(10.1) 


(a) 


=> 


0«C3) 


(10.2) 


<^ 


11.1) 


(CUM) 


=> 


(pCUM) 


(11.2) 


<= 


Rationality 


(12.1) 


(RatM) 


=> 


(pRatM) 


(12.2) 


<= (pdp) 


(12.3) 


■$= without (j-idp) 


(12.4) 


<= 

T a formula 


(13.1) 


(RatM =) 


=> 


(M=) 


(13.2) 


<= (Wp) 


(13.3) 


without (/idp) 


(13.4) 


<^ 

T a formula 


(14.1) 


(Log =') 


=> 


(P=) 


(14.2) 


<= (A*dp) 


(14.3) 


^ without (i-idp) 


(14.4) 


<=Ts formula 


(15.1) 


(Log ||) 


=> 


(Mil) 


(15.2) 


<^ 


(16.1) 


(LogU) 


=► (M Q + (/* =) 


(pU) 


(16.2) 


<= (pdp) 


(16.3) 


■f= without (j-idp) 


(17.1) 


(LogW) 


^(cg + {m =) 


(pu>) 


(17.2) 


<= (pdp) 


(17.3) 


■£= without (j-idp) 



Proof 

Set f(T) := f(M(T)), note that f(T U T") := f(M(T U T")) = f(M(T) n M(T')). 
We show first the general framework. 

Let |~ satisfy JiLi?) and (CCL). Let / : £> £ -> D £ be defined by f(M(T)) := M(T). If M (T) = M (T'^ thenjf = T , 
so by (LLE) T = W, so f(M(T)) = f(M(T')), so / is well defined and satisfies (/idp). By (C*CL) Th(M(T)) = T. 

Let / be given, and ^ be defined by W := Th(f(M(T))). Obviously, |~ satisfies (LLE) and (CCL) (and thus (RW)). If / 

satisfies (pdp), th en f(M (T)) = M(T') for some T' , and f(M(T)) = M(Th(f(M(T)))) = M(T) by Fact HH (page [29]) . 
(We will use Fact 12. 2. H (page [23]) now without further mentioning.) 

Next we show the following fact: 

(a) If / satisfies (pdp), or T is equivalent to a formula, then Th(f(T) n M(T')) = TUT', 

Case 1, / satisfies (/idp). Th(f(M(T))C\M(T')) = Th(M(T) n M(T') = TuT' by Fact (page USD (5). 

Case 2, T' is equivalent to 0'. Th(f(M(T)) n M(c/>')) = Th(f(M(T))) U {>'} = fu {</>'} by Fact HH (page [29]) (3). 
We now prove the individual properties. 
(1.1) (OR) =4> f>OR) 

Let X = M(T), Y = M(T'). f(X U Y) = f(M(T) U M(T')) = f(M(T V T')) := M(TVT') C (om M(T n¥) = (C cta 
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(1.2) (nOR) => {OR) 

TVT := Th(f(M(T V T'))) = Th(f(M(T) U M(T'))) D { „ OR) Th(f(M(T)) U /(M(T'))) = (by Fact EH (page Q$ ) 
Th(f(M(T))) n Th[f[M (T'))) =:TflT'. 

(2) By -nCon(T,T') O M(T) n M(T') = 0, we can use directly the proofs for 1. 

(3.1) (iuOP) =>• (fiwOR) 

Let _Jf = M(T), y = M(T'). /(IU7) = f(M(T) U M{T')) = /(M(TVT')) := M(TVT 7 ) C (w0fl) M(f nT 7 ) = (cci) 
M(T) U M(T') =: /(X) U F. 

(3.2) (^wOP) (wOP) 

TVT 7 := Th(f(M(T V T"))) = Th(f[M(T) U M(T'))) D^or) Th(f(M(T))UM(T')) = (by Fact [MJ (page [MJ ) 
Th(f(M(T))) n Th{M{T')) =: PnP 7 . 

(4.1) (SCO => 0* C) 
Trivial. 

(4.2) (/i C) (SC) 
Trivial. 

(5.1) (CP) =► (,/0) 
Trivial. 

(5.2) (^0) (CP) 
Trivial. 

(6.1) (PP) => (^PP) : 

Suppose X := M(T), y := M(T'), X C Y, we have to show f(Y) nlC /(X). By prerequisite, T C T, so TUT' = T, so 
TUT' = Tby (LLE). By (PP) TUT' C Fu T, so /(7)nX = f(T')nM(T) = M(FuT) C M(P U P') = M(f) = /(X). 

(6.2) (^PP) + (/idp) + (A* Q (PP) : 

/(T) n M(T') =(M c) /(r) n m(t) n m(t') = /(t) n m(t u t') c (AtPi?) /(t u t% so fur = Th(f(T u T)) c 

Th(f(T) n M(T')) = f U V by (a) above and (/xdp). 

(6.3) (/iPP) 7^ (PP) without (/idp) : 

(/iPP) holds in all preferential structures (see Definition 14.1.11 (page [57)) ) by Fact I4.2.T1 (page [61]). Example 14.2.11 (page 
|6"2"|) shows that (DP) may fail in the resulting logic. 

(6.4) (/iPP) + (m C) 4- (PP) if P' is classically equivalent to a formula: 

It was sh own in the proof of (6.2) that /(T) n M [</>') C /(PU ]>'}), so T U {0'} = Th(f(TU{<p'})) C Th(f (T) D M ((/>')) = 
TU {(/)'} by (a) above. 

(6.5) (/iPP') (PR), if T' is classically equivalent to a formula: 

f(M(T)) nM(cf>') C ( ^ fl0 f(M(T)nM (</>')) = /(M(TU{^»). So again PU pj = Tfe(/(TU{^})) C Th(f(T)nM(0>)) = 
TU {(/)'} by (a) above. 

(7.1) (CUT) => (fiCUT) 

So let X = M[T), Y = M(T'), and /(P) := M(f) C M(T') C M(T) ^TCfcl =(lle) W) => ( b Y [CUT)) 
T = (T) 2 (f 7 ) = f 7 => /(T) = M(f ) C M(F) = f(T'), thus f[X) C /(y). 

(7.2) (fiCUT) =*> (CUT) 

Let T C P 7 C f . Thus /(P) C Af (?) C M(T') C M(P), so by (^CUT) f(T) C f(T'), so ? = Th(f(T)) D Th(f(T')) = 
T 7 . 

(8.1) (CM) =► (/iCM) 

So let X = M(T), Y = M(T'), and /(T) := M(?) C M(P') C M(T) =*> T C P 7 C T = (LLE) (?) (by (LLE), (CM)) 
f = (^C (P 7 ) = f 7 => /(P) = M(?) D M (T 7 ) = f(T'), thus f(X) D f(Y). 

(8.2) (//CM) =S> (CM) 

Let T C f C _? . Thus by (//CM) and f[T) C M(?) C M(T') C M(P), so /(T) D f(T') by (//CM), so ? = Th[f[T)) C 

Th(f(T')) = f 7 . 

(9.1) (PesM) ^ (fxResM) 

Let /(X) := M(S), A := M(a), P := M(J3). So /(X) CiflB^A (~a,j9 ^ {Res M) A, a ^ /3 =^ A/(A = o = ) C M(/3) =^ 
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(9.2) (fj,ResM) =^ (ResM) 

Let f(X) := M(K), A := M(a), B := M(p). So A |~ a, /? /(X) UnB ^ (Alfes M) /(* n A) C B =► A, a |~ /?. 

(10.1) (CD) (// CD) 

Let /(T) C M(T'), f(T') C M (T). So Th(M(T')) C Th(f(T)), Th(M(T)) C Th(f(T')), so T' C T 7 C f, T C T C f 7 , so 
by (CD) f = P, so /(T) := M(f ) = M(P) =: f{T'). 

(10.2) (//CD) (CD) 

Let T C F and T' C T. So by (CCZ) Th(M (T)) = TcF= Th(f(T')). But Th{M (T)) C T7i(X) ^ X C M(T) : 
X C_M(T7i(X)) C M(Th(M(T))) = M(T). So /(T') C M(T), likewise /(T) C M(T'), so by (// CD) /(T) = /(T'), so 

T = T 7 . 

(11.1) (CC/M) (\iCUM) : 

So let X = M (T), y = M(T'), and /(T) := M(?) C M(T') C M(T) ^TCfC T = {LLE) (T) =^ f = (f) = (T 7 ) = f 7 
/(T) = M (T ) - M (T 7 ) = /(T'), ifcua /(X) = /(F). 

(11.2) (pCUM) => {CUM): 

Let T C T C f . Thus by (fxCUM) and /(T) C M(f ) C M(T') C M (T) , so /(T) = f(T'), so f = Th(f(T)) = Th(f(T')) 

= T 7 . 

(12.1) (RatM) (uRatM) 

Let X = M (T) , y = M(T'), and X C y, In/(7) ^ 0, so T h T and M(T) n/(M(T')) ^ 0, so Con(T,T^), soPuTcf 
by {RatM), so /(X) = f(M(T)) = M(T) C M (f 7 U T) = M(P) n M (T) = /(F) n X. 

(12.2) (pRatM) + (//dp) (RatM) : 

Let X = M(T), y = M(T'), T h T' , Con(T,¥), so X C Y and by (//dp) Xnf(Y) ^ 0, so by (pRatM) f(X) C /(F) nl, 
soT= TUT' = Th(f(T U T')) D Th(f(T') n M (T)) = Fu T by (a) above and (//dp). 

(12.3) (fxRatM) (RatM) without (//dp) : 

(fiRatM) holds in all ranked preferential structures (see Definition 14.1.41 (page [59")) ) by Fact 14.2771 (page [55 ]) . Example 
12.3.61 (page l4H)l (2) shows that (RatM) may fail in the resulting logic. 

(12.4) (fiRatM) => (RatM) if T is classically equivalent to a formula: 

4> h T =► M(^) C M(T). Con(<f>,¥) & M(¥) n M(0) ^ g /(7 1 ') n M(0) ^ by Fact [27273] (page [29]) (4). Thus 
f(M(4>)) C f(M(T')) n M(0) by (pRatM). Thus by (a) above T 7 U {>} C |. 

(13.1) (iiatM =) => (p =) 

Let X = M (T) , y = M(T'), and X C Y, Xpf(Y) £%, so T h T'_and M(T)C\f(M(T')) ^ 0, so Con(T,W), so Fu T = f 
by (.RatM =), so /(X) = f(M (T)) = M(T) = M(W U T) = M (T 7 ) n M(T) = f(Y) n X. 

(13.2) (// =) + (pdp) => (RatM =) 

Let X = M (T) , y = M(T'), T h T", Con(T,W), so X CY and by (//dp) X n /(y) ^ 0, so by (p =) /(X) = /(y) n X. 
So T 7 U T = T (a) above and (//dp). 

(13.3) (// =) ^ (EatM =) without (//dp) : 

(// =) holds in all ranked preferential structures (see Definition 14.1.41 (page [59|) ) by Fact I4.2.TI (page [65|) . Example 12.3.61 
(page SO]) (1) shows that (RatM =) may fail in the resulting logic. 

(13.4) (// =) => (RatM =) if T is classically equivalent to a formula: 

The proof is almost identical to the one for (12.4). Again, the prerequisites of (// =) are satisfied, so f(M(<j))) = f(M(T'))C\ 
M{4>). Thus, T 7 U {4>} = | by (a) above. 

Of the last four, we show (14), (15), (17), the proof for (16) is similar to the one for (17). 

(14.1) (Log =') => (// =') : 

f(M(T')) n M(T) ^ Con(¥ U T) =>( Log =') TUT' = ¥ U T ^ f(M(T U T')) = f(M(T')) n M(T). 

(14.2) (// =') + (//dp) (Lo.g =') : 

Conc ur) f(M(T')) n M(T) ^ => f(M(V U T)) = f(M(V) n M(T)) = (m=0 /(M(T')) n M(T), so Fuf = 

T'UT by (a) above and (//dp). 

(14.3) (// =') 7^ (Lo.g =') without (//dp) : 

Bv Fact 1472771 (pagel65l) (// =') holds in ranked structures. Consider Example ETSTB] (page SOI) (2). There, Con(T,T), 
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T =TUf, and it was shown that T'UT %T = TUT' 

(14.4) (/i =') => (Log =') if T is classically equivalent to a formula: 

Con(Fu{</>}) =^ ^ M(F)fWO) =*> f(T')r\ M((j)) £ by Fact[2jO](page[29]) (4). So /(M(T'U{0})) = /(M(T')nM(^)) 
= f(M(T')) n Af (0) by (/i ='), so T' U {^} = Pu {?!>} by (a) above. 

(15.1) (£o ff ||) => (/i ||): 
Trivial. 

(15.2) (/i ||) => (Log ||) : 
Trivial. 

(16) (LogU) (/zU) : Analogous to the proof of (17). 

(17.1) (LogU') + (n C) + (/i =) => ( M U') : 

f(M(T'))^M(T)-f(M^)J) £ 0^(by ( M= C), (/x =), Fact|4X5](page[6l ) f(M(V))nM(T) ^ 0, f(M(V))nf(M(T)) = 
Con(T',T), -^Con(T',T) => T V T' = T => f(M(T)) = /(M(TVT')) = f(M(T) U M(T')). 

(17.2) (/iU') + (/xdp) =>• (iogU') : 

Con(FuT), ^Con(Fuf) f(T')C\M( T) ^ 0, f(T')nf(T) = =► f(M(T')) n (M(T) - f(M(T))) ± =► f(M(T)) 

= f(M(T) U M(T')) = f(M(T V T')). So T = T V T'. 

(17.3) and (16.3) are solved by Example (page gUJ) (3). 
□ 

Example 2.3.6 

(1) (/i =) without (/xdp) does not imply (RatM =) : 

Take {p^ and put m := TOA Pi , the model which makes all p^ true, in the top layer, all the other in the bottom 

layer. Let m! ^ to, T := 0, T := Tft(m, to'). Then Then P = T', so C*on(F, T), T = Th(m'), T f UT = T. 
So (RatM =) fails, but (/i =) holds in all ranked structures. 

(2) (fiRatM) without (/xdp) does not imply (RatM): 

Take {pi and let to := rrih Pi , the model which makes all Pi true. 

Let X := M(-ipo) U { m } be the top layer, put the rest of Mc in the bottom layer. Let Y :— Mc- The structure is ranked, 
as shown in Fact 14.2.71 (page [55]) , (fiRatM) holds. 

Let T' := 0, T := Th(X). We have to show that Can{T,W), T h T', but F U T £ f. P = Th(M(p Q ) ~ {to}) = p5\ T 
= { ^po} V T h(fnj 1 T = T. So Con(T,¥). M(¥) = M(p ), Af (T) = X, M(FuT) = M(F) n M(T) = {to}, to |= pi, so 

Pi gFut, but x ¥=p x . 

(3) This example shows that we need (/xdp) to go from (/iU) to (LogL)) and from (/xl/) to (LogL! 1 ). 
Let v(£) := {p, q} U {p^ : i < w}. Let to make all variables true. 

Put all models of -p, and to, in the upper layer, all other models in the lower layer. This is ranked, so by Fact 14.2.71 (page 
13 (/xU) and (xtl/) hold. Set X := M(->q) U {to}, X' := M(q) - {to}, T := Th(X) = -.g V T/x(to), T' := Th(X') = q. 

Then T = pA^q, W = p~Aq. We have CW(F, T), -^Con(W,T). But TVT' = p ^ T = p A =q and Con(T V T', T'), so 

(LogU) and (LogU 1 ) fail. 

□ 



Fact 2.3.5 

(CUT) i> (PR) 

Proof 

We give two proofs: 

(1) If (CUT) => (Pi?), then by (/xPP) =>- (by Fact EH (page [32]) (3)) (pCUT) => (by Proposition [gXj (page [36]) (7.2) 
(CUT) =>■ (PR) we would have a proof of (/xPP) =>■ (Pi?) without (pdp), which is impossible, as shown by Example 14. 2. II 
(page [52). 

(2) Reconsider Example 12 .3 . 21 ( page [35 | . and say a |= pAg, b \= pA^q, c \= ->pAq. It is shown there that (fiCUM) holds, so 
(fj,CUT) holds, so by P roposition[2Jil (pageEgJ (7.2) (CUT) holds, if we definef := Th(f(M (T)). Set T := {pV(^pAg)}, 
T := then TUT' = W = Id A -.ok f = T. TUP' = T = ToT. so (Pi? 1 ) faiZs. 
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Chapter 3 

Abstract semantics by size 



3.1 The first order setting 

We first introduce a generalized quantifier in a first order setting, as this is very natural, and prepares the more abstract 
discussion to come. 

Definition 3.1.1 

Augment the language of first order logic by the new quantifier: If 4> and ip are formulas, then so are Vx<p(x), Vx0(x) : 
tp(x), for any variable x. The aversions are the restricted variants. We call any formula of C, possibly containing V a 
V — £— formula. 

Definition 3.1.2 

(TV-Model) 

Let £ be a first order language, and M be a L— structure. Let Af(M) be a weak filter, or A/"— system - J\f for normal - over 
M. Define (M,Af(M)) \= <fi for any V — £— formula inductively as usual, with one additional induction step: 

(M,Af(M)) \= Vx0(x) iff there is A e Af(M) s.t. Va e A ((M,JV(M)) \= cf>[a}). 
Definition 3.1.3 

Let any axiomatization of predicate calculus be given. Augment this with the axiom schemata 

(1) Vx</j(x) A Vx(0(x) -» tp(x)) Vxip(x), 

(2) Vx</j(x) => -iVx-«f>(x), 

(3) \/x<fi(x) Vx</j(x) and Vx0(x) =>■ 3x(j>(x), 

(4) Vx0(x) «-» \7y(j)(y) if x does not occur free in <p(y) and y does not occur free in <p(x). 
(for all <j>, if)). 

Proposition 3.1.1 

The axioms given in Definition 13. 1. 31 (page |4"3"| are sound and complete for the semantics of Definition 13. 1. 21 (page |4"3"| 
See |Sch95-lj or |Sch04j . 

Definition 3.1.4 

Call N + {M) = (N(N) : N C M) a Af+ - system or system of weak filters over M iff for each N C M Af(N) is a weak 
filter or Af— system over N. (It suffices to consider the definable subsets of M.) 

Definition 3.1.5 

Let £ be a first order language, and M a C— structure. Let Af + (M) be a N + — system over M. 
Define (M,N + (M)) |= <fi for any formula inductively as usual, with the additional induction steps: 

1. (M,M+(M)) \= Vx0(x) iff there is A e M{M) s.t. Va e A ((M,Af+(M)) \= <f>[a]), 

2. (M,M+(M)) \= Vx0(x) : i/j(x) iff there is A e Af({x : (M,Af+(M)) \= cj)(x)}) s.t. Va e A ((M,Af+(M)) \= i>[a}). 
Definition 3.1.6 

Extend the logic of first order predicate calculus by adding the axiom schemata 

(1) a. Vxc/j(x) O- Vx(x = x) : <f>(x), b. Vx(cr(x) <-> r(x)) A Vxct(x) : </>(x) =>• Vxr(x) : </>(x), 

(2) Vx</j(x) : V(x) A Vx(0(x) A ^i(x) -> i?(x)) Vx^(x) : j?(x), 

(3) 3x0(x) A Vx(/j(x) : t/>(x) =>■ ->Vx</>(x) : ->ip(x), 

(4) Vx(0(x) — > VC 1 )) => Vx0(x) : ?/>(x) and Vx0(x) : ^(x) -> [3x0(x) — > 3x(0(x) A ?A(x))], 

(5) Vx(/j(x) : "0(x) <-> Vyd(y) : ib(y) (under the usual caveat for substitution). 
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Proposition 3.1.2 

The axioms of Definition 13. f .61 (page I43|) are sound and complete for the J\f + — semantics of V as defined in Definition 
13X51 fpaee 143) . 

See |Sch95-l] or [Sch04j . 



3.2 General size semantics 

3.2.1 Introduction 
3.2.1.1 Context 



We show how one can develop a multitude of rules for nonmonotonic logics from a very small set of principles about 
reasoning with size. The notion of size gives an algebraic semantics to nonmonotonic logics, in the sense that a implies 
P iff the set of cases where a A holds is a small subset of all a— cases. In a similar way, e.g. Heyting algebras are an 
algebraic semantics for intuitionistic logic. 

In our understanding, algebraic semantics describe the abstract properties corresponding model sets have. Structural 
semantics, on the other hand, give intuitive concepts like accessibility or preference, from which properties of model sets, 
and thus algebraic semantics, originate. 

Varying properties of structural semantics (e.g. transitivity, etc.) result in varying properties of algebraic semantics, and 
thus of logical rules. We consider operations directly on the algebraic semantics and their logical consequences, and we 
see that simple manipulations of the size concept result in most rules of nonmonotonic logics. Even more, we show how 
to generate new rules from those manipulations. The result is one big table, which, in a much more modest scale, can be 
seen as a "periodic table" of the "elements" of nonmonotonic logic. Some simple underlying principles allow to generate 
them all. 

Historical remar ks: Th e firs t time th at abstract size was rel ated to nonmonotonic logics was, to our knowledg e, in the 
second author's [Sch90 and [Sch95-lj . and, independently, in |BB94j . More detailed remarks can e.g. be found in |GS08cj . 
But, again to our knowledge, connections are elaborated systematically and in fine detail here for the first time. 

3.2.1.2 Overview 

The main part of this Section is the big table in Section [3.2.2.61 fpage H6")l. It shows connections and how to develop a 
multitude of logical rules known from nonmonotonic logics by combining a small number of principles about size. We use 
them as building blocks to construct the rules from. 

These principles are some basic and very natural postulates, (Opt), (iM), (eMI), (eMT), and a continuum of power of 
the notion of "small", or, dually, "big", from (1 * s) to (< uj * s). From these, we can develop the rest except, essentially, 
Rational Monotony, and thus an infinity of different rules. 

This is a conceptual Section, and it does not contain any more difficult formal results. The interest lies, in our opinion, 
in the simplicity, paucity, and naturalness of the basic building blocks. We hope that this schema brings more and deeper 
order into the rich fauna of nonmonotonic and related logics. 



3.2.2 Main table 
3.2.2.1 Notation 

(1) I{X) C V(X) and T(X) C "P(X) are dual abstract notions of size, 2(X) is the set of "small" subsets of X, F(X) 
the set of "big" subsets of X. They are dual in the sense that A € T{X) & X - A e F{X). " X " evokes "ideal", " 
T " evokes "filter" though the full strength of both is reached only in (< to * s). "s" evokes "small", and " (x * s) " 
stands for " x small sets together are still not everything" . 

(2) If A C X is neither in 1(X), nor in T{X), we say it has medium size, and we define M(X) := V{X) - (l(X)UT(X)). 
A4 + (X) := V{X) — T(X) is the set of subsets which are not small. 

(3) Vx<p is a generalized first order quantifier, it is read "almost all x have property ip ". Vx(0 : ip) is the relativized 
version, read: "almost all x with property <f> have also property tjj " . To keep the table simple, we write mostly 
only the non-relativized versions. Formally, we have Vx0 :<^=> {x : 4>{x)} £ J~(U) where U is the universe, and 
Vx{<f) : ijj) :<S4> {x : (0 A 4>){x)} £ F{{x : <j){x)}). Soundness and completeness results on V can be found in [Sch95-1 . 

(4) Analogously, for propositional logic, we define: 
a^:» M(a A 0) 6 F(M(a)), 

where M{ip) is the set of models of (f>. 

(5) In preferential structures, fi(X) C X is the set of minimal elements of X. This generates a principal filter by 
J-{X) :={ACI: f-(X) C A}. Corresponding properties about /i are not listed systematically. 



The usual rules (AND) etc. are named here (AND,..), as thev are in a, natural ascending line of similar rules. ba,sed 



3.2. GENERAL SIZE SEMANTICS 45 

3.2.2.2 The groupes of rules 

The rules are divided into 5 groups: 

(1) (Opt), which says that "All" is optimal - i.e. when there are no exceptions, then a soft rule |~ holds. 

(2) 3 monotony rules: 

(2.1) (iM) is inner monotony, a subset of a small set is small, 

(2.2) (eMI) external monotony for ideals: enlarging the base set keeps small sets small, 

(2.3) (eMT) external monotony for filters: a big subset stays big when the base set shrinks. 

These three rules are very natural if "size" is anything coherent over change of base sets. In particular, they can be 
seen as weakening. 

(3) (rs) keeps proportions, it is here mainly to point the possibility out. 

(4) a group of rules x * s, which say how many small sets will not yet add to the base set. 

(5) Rational monotony, which can best be understood as robustness of see (Ai ++ )(3). 

3.2.2.2.1 Regularities 

(1) The group of rules (x * s) use ascending strength of I/T. 

(2) The column (M + ) contains interesting algebraic properties. In particular, they show a strengthening from (3 * s) up 
to Rationality. They are not necessarily equivalent to the corresponding (I x ) rules, not even in the presence of the 
basic rules. The examples show that care has to be taken when considering the different variants. 

(3) Adding the somewhat superflous (CM2), we have increasing cautious monotony from (wCM) to full (CM U ). 

(4) We have increasing "or" from (wOR) to full (OR u ). 

(5) The line (2 * s) is only there because there seems to be no (A4j), otherwise we could begin (n * s) at n = 2. 

3.2.2.3 Direct correspondences 

Several correspondences are trivial and are mentioned now. Somewhat less obvious (in)dependencies are given in Section 
13.2.31 (page |4"8|) . Finally, the connections with the a— rules are given in Section [3.2.41 (page \52~\i . In those rules, (I u ) is 
implicit, as they are about principal filters. Still, the \i— rules arc written in the main table in their intuitively adequate 
place. 

(1) The columns "Ideal" and "Filter" are mutually dual, when both entries are defined. 

(2) The correspondence between the ideal/filter column and the V— column is obvious, the latter is added only for 
completeness' sake, and to point out the trivial translation to first order logic. 

(3) The ideal/filter and the AND-column correspond directly. 

(4) We can construct logical rules from the A4 + — column by direct correspondence, e.g. for (.M+), (1): 
Set Y := M (7), X := Af (7 A 0), A := M (7 A @ A a). 

• X G M + (Y) will become 7 \/> ->/3 

• A G T(X) will become 7 A (3 (~ a 

• A G M + (Y) will become 7 |^ -.(a A (3). 

so we obtain 7 \ft 7A/J ^ a => 7 \f -i(a A (3). 

We did not want to make the table too complicated, so such rules are not listed in the table. 

(5) Various direct correspondences: 

• In the line (Opt), the filter/ideal entry corresponds to (SC), 

• in the line (iM), the filter/ideal entry corresponds to (RW), 

• in the line (eMI), the ideal entry corresponds to (PR') and (wOR), 

• in the line (eMT), the filter entry corresponds to (wCM), 

• in the line («), the filter/ideal entry corresponds to (disjOR), 

• in the line (1 * s), the filter/ideal entry corresponds to (CP), 

• in the line (2 * s), the filter/ideal entry corresponds to (CM2) = (OR%). 

(6) Note that one can, e.g., write (AND2) in two flavours: 

• a |~ (3, a |~ 13' => a \f ->/3 V -y3', or 

• a |~ (3 a \/j ->{3 

(which is (CM*) = (OR?).) 
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3.2.2.4 Rational Monotony 

(RatM) does not fit into adding small sets. We have exhausted the combination of small sets by (< u * s), unless we go 
to languages with infinitary formulas. 

The next idea would be to add medium size sets. But, by definition, 2 * medium can be all. Adding small and medium 
sets would not help cither: Suppose we have a rule medium + n* small ^ all. Taking the complement of the first medium 
set, which is again medium, we have the rule 2 * n * small ^ all. So we do not see any meaningful new internal rule. i.e. 
without changing the base set. 

Probably, (RatM) has more to do with independence: by default, all "normalities" are independent, and intersecting with 
another formula preserves normality. 



3.2.2.5 Summary 

We can obtain all rules except (RatM) and (~) from (Opt), the monotony rules - (iM), (eMI), (eMT) -, and (x*s) with 
increasing x. 



3.2.2.6 Main table 



\ "Ideal" "Filter" j M"'" j V , various rules > AND | OR. j Caut . /Rat . Mon. 


Optimal proportion 


{Opt) 1 id e T(X) : Xe3-'(X) \ | | (SO) I I I 

| | a h {3 => a ^ (3 \ ! j 


Monotony (lmpr.>\ i n :.■ [ > n- c >]>■ >v 1 i . >);.-■ ) . ; .■ i 1 i : i nt . rn;il uiuu.it' my, (eM'J,): external monotony tor it leal;-. ; r M J- ) : external monotony tor ti Iters 


(»M) 


A C B £ X(X) =5- 

a e x(x) 


A £ F(X), A C B C X 
=> B £ IF(X) 




Vx0 A V:c(0 — * 0') 
-. Vx0' 


(8W) 
a h |9, 3 1- 3' => 
a h /3' 










X(X)~C T(V) 






V ( </> : t/>) A 
Va;(0' V) -* 

Vx(* v 4> t 


(fH ) 
a h (3, a h c', 

a' J~ /3 
(fiPH) 

x c y => 
M (y) n x c n(x) 




a h |3, a' h => 
a V a' h /3 
((■•OB) 

n(x u y) c ^(x) u y 




(eMJF) 




x c_ y 
:F(y) n -p(x) c :F(X) 




Vx(4> : tp)A 
Vx(^» A 4>' : tp) 








(wOM) 

a |~ 0, <*' K «, 
a A h a' 


Keej ) li.i ] >r< >[ >< >rr k his 


(«) 


(I U disj) 
A £ X(X), B £ X(y), 

x n y = => 
A u b e x(x u y) 


(JF U disj) 
A £ F(X), B £ IF(y), 

x n y = =s- 
A u B £ F{X u y) 




\7x(4> : ip)A 
Va=(0' : V)A 
^3a3(0 A 0') -> 
Vs(^ V : iP) 






(disjOH) 
(j> r*j %l> , <j) K' */i 

1 — 10' , 

V |*>j i/; V i/> 
(ndisjOB) 

x n y = => 
m(x u y) c h(x) u n(y) 




HODUStness ot ] h'i >pi in j ,i i .- n -■■ small All 


(1 . s) 


(li 

X g X(X) 


9 E ^(x) 




(Vi) 
Vx0 — » 3a;0 


(CP) 


{AN Di) 






(2 . s) 


(X 2 ) 
A, B E X(X) => 
A u B ^ X 


A, B £ IF(X) => 
A n B # 




(V 2 ) 
Vx^» A Vxt/j 
— > Bx(4> A i/ 1 ) 




(AJVD 2 ) 
Q h 3. a K- 3' =* 
a 1/ ^(i V -,/S' 




(CM 2 ) 
a |~ /3 => a ^/3 


(„ , s) 
(n > 3) 


(X„) 
A b ., A n E X(X) 

A 1 u . U A n ^ X 


A ± , ., A„ £ X(X) 

A i n . n A^ ^ 


xi e ^P>4), -, 

X 7l _ 1 £ IF(X n ) => 
Xj £ A4+(X„) 


(V„) 
Vi^i A . A Vi^n 

3as(*i A . A 




(AATD n ) 
a h Oi, ., a h 0n => 
a t/ ^(3! V . V ^ff n 


(OH„) 
«1 h 0, ., « n _i I 3 

t>l V . V a n _! b 4 -3 


(CM„) 
a K 01 ) ■. a ^ n -l 

a A 0i A . A 0„_ 2 ^ ^0„_i 


« <v , a) 


A, B £ X(X) => 
A u B £ X(X) 


A, B £ IF(X) =t. 
A n B £ IF(X) 


(Mi) 

A £ IF(X), X £ x+(y) 
=> A £ A4+(y) 
(2) 

A £ A4+(X), X £ T(Y) 

=> A £ x+(y) 

(3) 

A E JF(X), X £ T(Y) 

=> A £ ;F(y) 

(4) 

A, B £ X(X) => 
A - B £ I(X-B) 


(V») 
Vi^ A Villi — * 
Vx(0 A V) 




(AN Du) 
a h 3, a h- |3' =5- 
a h (3 A 0' 


(OBu) 
a |~ 0, a' h 3 =5- 

a v a' h- 
(/.OK) 

m(x u y) c n(x) u n(y) 


(OM„) 
a h 3. ° h /3' => 
a A |~ 0' 
(nCM) 

Ai(x) c y c x 
M (y) c n(x) 


Robustness of _A4^ 








(-M + + ) 
(1) 

A £ X(X), B g IF(X) 
=> A - B £ X(X - B) 
(2) 

A £ T(X), B i JC-(X) 
=s- A - B £ IF(X-B) 
(3) 

A £ _M+(X), 

x £ jw+(y) 

=5- A £ M+(Y) 










(RatM) 

^ V) ^ i V" =^ 

A i/j i/j 
(nBatM) 

x c y, 
x n n(y) ^ =* 
m(x) c fj(y) n x 
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Remark 3.2.1 

There is, however, an important conceptual distinction to make here. Filters express "size" in an abstract way, in the 
context of nonmonotonic logics, a |~ j3 iff the set of a A -</3 is small in a. But here, we were interested in "small" changes 
in the reference set A (or a in our example). So we have two quite different uses of "size", one for nonmonotonic logics, 
abstractly expressed by a filter, the other for coherence conditions, ft is possible, but not necessary, to consider both 
essentially the same notions. But we should not forget that we have two conceptually different uses of size here. 

3.2.3 Coherent systems 
3.2.3.1 Definition and basic facts 

Note that whenever we work with model sets, the rule 
(LLE), left logical equivalence, h a <-> a' => (a |~ j3 a' (~ (5) 
will hold. We will not mention this any further. 
Definition 3.2.1 

A coherent system of sizes, CS, consists of a universe U, y C V(U), and for all A £ y a system T(X) C V(X) (dually 
F(X), i.e. A £ F{X) «I-ie T(X)). y may satisfy certain closure properties like closure under U, fl, complementation, 
etc. We will mention this when needed, and not obvious. 

We say that CS satisfies a certain property iff all X, Y £ y satisfy this property. 
CS is called basic or level 1 iff it satisfies (Opt), (iM), (eMI), (eMJ 7 ), (1 * s). 
CS is level x iff it satisfies [Opt), (iM), (eMI), (eM T) , (x*s). 

Fact 3.2.2 

Note that, if for any Y I(Y) consists only of subsets of at most 1 element, then (eMJ-) is trivially satisfied for Y and its 
subsets by (Opt). □ 



Fact 3.2.3 

Let a CS be given s.t. y — V(U). If X G y satisfies (A4 ++ ), but not (< u> * s), then there is Y £ y which does not satisfy 

(2*8). 

Proof 

We work with version (1) of (Ai ++ ), we will see in Fact 13.2. TOl fpage [5lT) that all three versions are equivalent. 

As X does not satisfy (< lu * s), there are A, B £ 1(X) s.t. A U B £ M + (X). A £ I(X), A U B £ M + (X) 
X-(AUB) ^T(X), so by (M ++ )(l) A = A — (X — (All B)) £ 1(X — (X — (All B))) =X(AUB). Likewise B £ l(Al)B), 
so (2 * s) does not hold for A U B. □ 



Fact 3.2.4 

(eMI) and (eMT) are formally independent, though intuitively equivalent. 
Proof 

Let U := {x, y, z}, X := {x, z}, y := V(U) - {0} 

(1) Let T(U) := {A C [7 : z £ A}, T(Y) = {Y} for all Y all. (Opt), (iM) hold, (eMI) holds trivially, so does (< uj * s), 
but (eMf) fails for U and A. 

(2) Let T( X) := {{z}, X}, F(Y) := {Y} for all Y C [7, F ^ A. (Opt), (iM), (< w * s) hold trivially, (eMf) holds by 
Fact EX1 (page @SJ|. (eMI) fails, as {a;} G Z(A), but {x} ^ J(C7). 

□ 



Fact 3.2.5 

A level n system is strictly weaker than a level n + 1 system. 
Proof 

Consider {/ := {1, . . . , n + 1}, y := V(U) - {0}. Let J(J7) := {0} U {{x} : x £ U}, I(X) := {0} for X ^ U. (iM), (eMI), 
(eMJ 7 ) hold trivially, (n * s) holds trivially for X^U, but also for U. ((n + 1) * s) does not hold for U. □ 
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Remark 3.2.6 

Note that our schemata allow us to generate infintely many new rules, here is an example: 

Start with A, add si±, Si )2 two sets small in A U si : i (A U Si )2 respectively). Consider now A U s± t i U Si )2 and s 2 s.t. S2 is 
small in A U s\ t i U Si, 2 U s 2 . Continue with S3 i, S3, 2 small in A U Si ; i U si, 2 U S2 U S3,i etc. 

Without additional properties, this system creates a new rule, which is not equivalent to any usual rules. 
□ 



3.2.3.2 The finite versions 
Fact 3.2.7 

(1) (I n ) + (eMI) => (M+), 

(2) (I n ) + (eMI) => (CM n ), 

(3) (/„) + (eMI) (OR n ). 

Proof 

(1) 

Let X x C . . . C X„, so X„ = X x U (X 2 - X x ) U . . . U (X„ - X„_i). Let X G .F(X i+1 ), so X +1 - X G I(X +1 ) C I(X„) 
by (eMI) for 1 < i < n - 1, so by (J„) X a G A1+(X„). 

(2) 

Suppose a ^ft, ...,a |~ ft,_i, but a Aft A ... A /3„_ 2 |~ ->fti_i. Then M (a A — >/?i ),..., M(a A ->/3«_i) G I(M(a)), and 
M(aAft A...A/3„_ 2 A/3„_i) G I(M(o: Aft A . . . Aft_ 2 )) CI(tf(o)) by (eMI). But M(a) =M(aAnft)U...Utf(aA 
—ip n -i) U M(a A ft A ... A /3„_ 2 A fti-i) is now the union of n small subsets, contradiction. 

(3) 

Let ai ^ ft . . . ,a n _i |~ ft so M(a, A ->/?) G I(M(a l )) for 1 < i < n - 1, so M(a i A -.ft) G I(M(ai V ... V a„-i)) 
for 1 < % < n - 1 by (eMI), so M((cn V ... V a„_i) A ft) = M(a\ V . . . V a„-i) - U{M(a l A ->/3) : 1 < i < n - 1} ^ 
I(M(a 1 V...Va„_i)) by (/„), so ai V . . . V |^ ->j9. 

□ 

In the following example, (OR n ), (M.^), (CM n ) hold, but (I n ) fails, so by Fact 13.2.71 fpage [49|) (I„) is strictly stronger 
than {OR n ), (M+), (CM n ). 

Example 3.2.1 

Let n > 3. 

Consider X := {1, . . . , n}, y := V(X) - {0}, I(X) := {0} U {{i} : 1 < i < n}, and for all Y C X 1(Y) := {0}. 
(Opt), (iM), (eMI), (eMT) (by Fact [3X2] (page 011) ), (1 * s), (2 * s) hold, (I n ) fails, of course. 

(1) (ORn) holds: 

Suppose a.\ |~ ft . . . , q„_i |~ ft «i V . . . V a n -i |~ -'ft 

Case 1: ax V ... V a n -i I — 'ft then for all i ai I 1/?, so for no i Oj |~ /? by (1 * s) and thus (A/V.D1), contradiction. 

Case 2: ai V . . . V a n -i 1/ -*ft then M(ai V ... V a n -i) = X, and there is exactly 1 fc 6 X s.t. fc |= ft Fix this fc. By 
prerequisite, |~ ft If M(a,) = X, h /3 cannot be, so there must be exactly 1 fc' s.t. fc' |= -ift but cetrd(X) > 3, 
contradiction. So M(cq) C X, and a* h ft so M(aj) = or M(oti) = {fc} for all i, so M(cei V. . .Vce„_i) ^ X, contradiction. 

(2) (A4+) holds: 

CM^t) is a consequence of (M^), (3) so it suffices to show that the latter holds. Let X\ G JF(X 2 ), X 2 G ^(^3). Then 
Xi = X 2 or X 2 = X3, so the result is trivial. 

(3) (CMn) holds: 

Suppose a |~ /3i, . . . , a |~ /J„_i, a A /3i A ... A /3„_ 2 |~ ->f} n -\- 

Case 1: For all i, 1 < i < n — 2, a h then M(a A /?i A ... A /3 n _ 2 ) = M(a), so ce |~ /3„_i and a |~ -i/3 n _i, contradiction. 

Case 2: There is i, 1 < t < n - 2, a l/ft, then M (a) = X, M(a A ft A ... A /3„_ 2 ) C M (a), so a A ft A ... A /3„_ 2 H -ift-i- 

Card(M(a A ft A ... A ft_ 2 )) > n — (n — 2) = 2, so card(M(-^p n -i)) > 2, so a ^ ft_i, contradiction. 

□ 
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Proof 

" => " 

Suppose all sets are definable. 

Let A,B G X(X), X = M(a), A = M(a A -ft), B = M(a A -/?'), so a ^ ft « h ft: so by (CM U ) a A ft |- /?, so 
A — B = M(a A ft A -ft G X(M(a A ft)) = I(X-B). 

Let a (~ /3, a |~ ft, so M(a A -ft G T(M(a)), M(a A -.ft) G I(M(a)), so by prerequisite M(a A -.ft) - M(a A -ft) = 

M(a A /3 A -ft) G Z(M (a) - M(a A -ft) = I(M(a A ft), so a A /? ^ ft. 

□ 



Fact 3.2.9 

(1) (!„) + (eMI) (OiL), 

(2) (I u ) + (eMI) (A4+) (1), 

(3) (/„) + (eMT) =► (X+) (2), 

(4) (/„) + (eMI) =► (A4+) (3), 

(5) (7 U ) + (eMf) => (X+) (4) (and thus, by Fact [3~2~^1 (page m . (CM,)). 

Proof 

(1) 

Let a ^ft a' |v |3 4 M(a A -ft G 2"(M(a)), M(a' A -ft G Z(M(a')), so by (eMI) M(a A -ft G I(M(a V a')), 
M(a' A -ft G X(M(a V a')), so M((a V a') A -ft G I(M(a V a')) by (I w ), so a V a' |~ /3. 

(2) 

Let A C X CY, A e X(F), X - A G X(X) C (eMI) 1(F) ^I=(I-4)Uie 1(F) by (4,). 
(3) 

Let i C I C 7, let A G 1(F), Y - X G 1(F) => A U (F - X) G 1(F) by (/„,) =$> X - A = F - (A U (F - X)) G T{Y) 
X - A G :F(X) by (eMf). 

(4) 

LetiCIC^ie JF(X), X G ^(F), so F-X G 1(F), X-A G X(X) C (eMX) 1(F) => F — yl = (F-X)U(X-A) G 1(F) 
by (J u )=j- AeT(Y). 

(5) 

Let ABCX,4,B6 X(X) => (Iu) AUB e J(X) X-(AUB) e F{X), but X-(AJB) C X-B, so X-(AUB) e .F(X-B) 

by (eMf), soA-B=(l-B)-(I-(4UB))e X(X-B). 

□ 



We give three examples of independence of the various versions of (M.^). 
Example 3.2.2 

All numbers refer to the versions of (M.^)- 
For easier reading, we re-write for A C X C Y 
(M+)(l) : A G T{X), A G 1(F) ^le 1(F), 
(M+)(2) : X G ^(F), A G 1(F) A G X(X). 

We give three examples. Invest igating all possibilities exhaustively seems quite tedious, and might best be done with the 
help of a computer. Fact 13.2/21 (page 148)) will be used repeatedly. 

• (1), (2), (4) fail, (3) holds: 

Let F := {a, b, c}, y := V{Y) - {0}, T{Y) := {{a, c}, {b, c}, F} 

Let X := {a, b}, T{X) := {{a}, X}, A := {a}, and J^(Z) := {Z} for all Z ^ X,F. 

{Opt), (iM), (eMI), (eMJ) hold, fails, of course. 

(1) fails: A G T{X), A G 1(F), X g 1(F). 

(2) fails: {a, c} G ^(F), {a} G 1(F), but {a} £T({a,c}). 

(3) holds: If X, G T(X»\ Xo e KX 3 ), then Xi = X, or X, = Xa, so (3) holds trivially (note that X 4 T(YY\, 
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• (2), (3), (4) fail, (1) holds: 

Let Y := {a, b, c}, y := V(Y) - {0}, T(Y) := {{a, b], {a, c}, F} 

Let X := {a, 6}, J" (X) := {{a}, X}, and T(Z) := {Z} for all Z ^ X, Y. 

(Opt), (iM), (eMT), (eMT) hold, (J w ) fails, of course. 

(1) holds: 

Let Xi G .F(X 2 ), Xi G T(X 3 ), we have to show X 2 G I(X 3 ). If Xi = X 2 , then this is trivial. Consider X x G T(X 2 ). 
If Xi ^ X 2 , then Xi has to be {a} or {a, 6} or {a, c}. But none of these are in T(X 3 ) for any X 3 , so the implication 
is trivially true. 

(2) fails: {a,c} G T(Y), {c} G Z(F), {c} £ l({a, c}). 

(3) fails: {a} G J"(X), X G {a} g ^(F). 

(4) fails: {6}, {c} G 1(F), {c} £ X(F - {fe}) = X({a, c}) = {0}. 

• (1), (2), (4) hold, (3) fails: 

Let Y := {a, b, c}, y := V(Y) - {0}, T(Y) := {{a, b}, {a, c}, F} 

Let T({a, b}) := {{a}, {a, b}}, jF({a, c}) := {{a}, {a, c}}, and ^(Z) := {Z} for all other Z. 
(Opt), (iM), (eMT), (eMT) hold, (T) fails, of course. 

(1) holds: 

Let Xi G .F(X 2 ), Xi G I(X 3 ), we have to show X 2 G T(X 3 ). Consider X x G X(X 3 ). If Xi = X 2 , this is trivial. If 
^ X x G I(X 3 ), then X x = {b} or Xi = {c}, but then by Xi G T(X 2 ) X 2 has to be {&}, or {c}, so X 1 = X 2 . 

(2) holds: Let X 1 C X 2 C X 3 , let X 2 G J 7 ^), Xi G X(X 3 ), we have to show X 1 G X(X 2 ). If X 1 = 0, this is trivial, 
likewise if X 2 = X 3 . Otherwise X\ — {b} or X\ = {c}, and X 3 = Y. If X\ = {b}, then X 2 = {a, b}, and the condition 
holds, likewise if X 1 = {c}, then X 2 = {a, c}, and it holds again. 

(3) fails: {a} G T({a,c}), {a,c} G T(Y), {a} g" T(Y). 

(4) holds: 

UA,B<= J(X), and A ^ B, A, B + 0, then X = Y and e.g. A = {c}, B = {fe}, and {c} G T(Y - {6}) = I ({a, c}). 

□ 



3.2.3.4 Rational Monotony 
Fact 3.2.10 

The three versions of (M ++ ) are equivalent. 

(We assume closure of the domain under set difference. For the third version of (M ++ ), we use (iM).) 
Proof 

For (1) and (2), we have A, B C X, for (3) we have A C X CY. For A,B C X, (X — B) — ((X — A) — B) = A — B holds. 

(1) => (2) : Let A G ^(X), B .F(X), so X - A G T(X), so by prerequisite (X — A) — B G X(X-B), so .4 - B = 
(X - B) - ((X - A) - B) G .F(X-B). 

(2) (1) : Let A G I(X), B # T(X), so X - A G .F(X), so by prerequisite (X — A) — B G -F(X-B), so 4 - B = 
(X - B) - ((X - A) - B) G J(X-B). 

(1) =► (3) : 

Suppose A M + (Y), but X G 7W+(F), we show A M + (X). So A G J(F), F-X g .F(Y), so by (1) A = A- (Y-X) G 

x(y-(y-x)) = j(x). 

(3) =► (1) : 

Suppose ^ - B g X(X-B), B £ .F(X), we show A £ J(X). By prerequisite A - B G X+(X-B), X - B G -M+(X), so by 

(3) A - B G M+(X), so by (iM) A G M+(X), so A ^ J(X). 

□ 



Fact 3.2.11 

We assume that all sets are definable by a formula. 
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Proof 

We show equivalence of (RatM) with version (1) of (A4 ++ ). 

" => " 

We have A,B C X, so we can write X = M((p), A = M(<p A -.^i), B = M((j) A -.-0')- ^ G ^PO: B -^PO* so l~ V>, 
-*/>', so by {RatM) <PAip' \~ ip, so A — B = M(<p A-^) - M (<j> A -op') = M (<p A ip' A ^ip) G I(M(<p A i/S)) =I(X-B). 



Let (j> [~ ip, <j> ^ip', so MO A -V) G l(M(<p)), M(<p A -«/>') T(M(<p)), so by (M++) (1) M (0 A ip' A up) = 

M (0 A -.-0) -M{(j)A -.^0 G (0 A "0')). so <fi Aip' |~ -0- 

□ 



3.2.4 Size and principal filter logic 

The connection with logical rules was shown in the table of Definition 12.31 (page [BUI) . 

(1) to (7) of the following proposition (in different notation, as the more systematic connections were found only afterwards) 
was already published in |GS08cj . we give it here in totality to complete the picture. 

Proposition 3.2.12 

If f(X) is the smallest A s.t. A G J-{X), then, given the property on the left, the one on the right follows. 

Conversely, when we define !F(X) := {X' : f(X) C X' C X}, given the property on the right, the one on the left follows. 
For this direction, we assume that we can use the full powerset of some base set U - as is the case for the model sets of a 
finite language. This is perhaps not too bold, as we mainly want to stress here the intuitive connections, without putting 
too much weight on definability questions. 

We assume (iM) to hold. 



1.1 


(eMI) 


=>■ 


(ixwOR) 


1.2 


<^ 


(2.1) 


(eMI) + (J w ) 


=> 


(fiOR) 


(2.2) 


<^ 


(3.1) 


(eMI) + (7.) 


=>- 


(uPR) 


(3.2) 


<^ 


(4.1) 


(I U disj) 


=>- 


(fidisjOR) 


(4.2) 


<^ 


(5.1) 


(Mt)(4) 




(nCM) 


(5.2) 


<^ 


(6.1) 


GM++) 


=>- 


(fiRatM) 


(6.2) 


-4= 


(7.1) 


(/«) 


=>- 


(fiAND) 


(7.2) 


<*= 


(8.1) 


(eMI) + {J u ) 


=> 


(LiCUT) 


(8.2) 




(9.1) 


(eMI) + (I u ) + (Mi)(4) 




(HCUM) 


(9.2) 


4= 


(10.1) 


(eMI) + (!„) + (eMT) 


=>- 


(M°) 


(10.2) 





Note that there is no (fiwCM), as the conditions (fj, . . . .) imply that the filter is principal, and thus that (I u ) holds - we 
cannot "see" (wCM) alone with principal filters. 



Proof 

(1.1) (eMT) => (liwOR) : 

X - f(X) is small in X, so it is small in X U Y by (eMT), so A := X U Y - (X - f(X)) G JF(X U Y), but A C /(X) U Y, 
and /(X U Y) is the smallest element of .F(X U Y), so /(X U Y) C A C /(X) U Y. 

(1.2) (nwOR) => (eMT) : 

Let ICY,!' := Y-X. Let A G Z(X), so X - A e F(X), so /(X) C X-A, so f(X U X') C /(X) U X' C (X - A) U X' 
by prerequisite, so (X U X') - ((X -A)UX')=Ae 1(X U X'). 

(2.1) (eMT) + (7a,) => (jjtOR) : 

X - f(X) is small in X, Y - f(Y) is small in Y, so both are small in X U Y by (eMI), so A := (X - /(X)) U (Y - f(Y)) 
is small in X U Y by (7 W ), but X U Y - (f(X) U f(Y)) C A, so /(X) U f(Y) G JF(X u Y), so, as /(X U Y) is the smallest 
element of T(X U Y), /(X U Y) C /(X) U f(Y). 

(2.2) (/iOiZ) ^ (eMI) + (/„): 

Let again X C Y, X' := Y-X. Let A G J(X), so X - A G ^(X), so /(X) C X-A. f(X') C X', so /(X U X') C 
/(X) U f(X') C (X - A) U X' by prerequisite, so (X U X') - ((X - A) U X') = A G J(X U X'). 

(I w ) holds by definition. 
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Let X C Y. Y—f(Y) is the largest element of 1(Y), X-f(X) E I(X) C 1(Y) by (eMI), so (X-/(X))U(Y-/(Y)) E 1(Y) 
by (!„), so by "largest" X - /(X) C Y - /(Y), so f(Y) RIC /(X). 

(3.2) (fiPR) => (eMI) + (IJ 

Let again I C F, I' := Y-X. Let A 6 T(X), so I - i e F{X), so /(X) C X-A, so by prerequisite f(Y) RIC X-A, 
so /(Y) C X' U (X-A), so (X U X') - (X' U (X - A)) = A g J(Y). 

Again, (1^) holds by definition. 

(4.1) (lUdisj) (^disjOR) : 

If X n Y = 0, then (1) A g I(X), B 6 I(Y) ^ A U B e 2(X U Y) and (2) A E F(X), B E T(Y) Al)B E T(X U Y) are 
equivalent. (By X n Y = 0, (X - A) U (Y - B) = (X U Y) - (A U B).) So /(X) g .F(X), /(Y) g F(Y) =*> (by prerequisite) 
/(X) U f(Y) g .F(X U Y). /(X U Y) is the smallest element of T(X U Y), so /(X U Y) C /(X) U /(Y). 

(4.2) (ndisjOR) => (lUdisj) : 

Let X C Y, X' := Y-X. Let A g 1(X), A' g I(X'), so X-A g J"(X), X'-A' g .F(X'), so /(X) C X-A, /(X') C X'-A', 
so /(XUX') C /(X)U/(X') C (X-A)u(X'-A') by prerequisite, so (XUX')-((X-A)U(X'-A')) = AuA' g T(XUX'). 

(5.1) (M+) =► (/iCM) : 

/(X) CYCX^X-Ye Z(X), X - /(X) g J(X) (by (M+), (4)) A := (X - /(X)) - (X - Y) g 1(Y) =► 
Y - A = /(X) - (X - Y) g JT(Y) => /(Y) C /(X) - (X - Y) C /(X). 

(5.2) (/iCM) =► (M+) 

Let X-A g X(X), so A g F(X), let B g X(X), so /(X) C X-B C X, so by prerequisite /(X-B) C /(X). As A g ^(X), 
/(X) C A, so /(X - B) C /(X) C A n (X - B) = A-B, and A — B E J 7 (X-B), so (X - A) - B = X - (A U B) = 
(X - B) - (A - B) g T(X-B), so (M+), (4) holds. 

(6.1) (M ++ ) QiRatM) : 

Let X C Y, X n /(Y) ^ 0. If Y - X g JF(Y), then A := (Y - X) n f(Y) E T(Y), but by X n /(Y) ^ A C /(Y), 
contradicting "smallest" of f(Y). So Y - X ^ .F(Y), and by (M ++ ) X - /(Y) = (Y - /(Y)) - (Y - X) g J(X), so 

x n /(Y) g jr(x), so /(x) c f(Y) n x. 

(6.2) (uRatM) => (M++) 

Let A g JF(Y), B £ .F(Y). B £ JF(Y) ^ Y - B J(Y) =>• (Y - B) n /(Y) ^ 0. Set X := Y-B, so X n /(Y) ^ 0, X C Y, 
so /(X) C f(Y) n X by prerequisite. f(Y) CA^> /(X) C f(Y) fll = /(Y) -BE A-B. 

(7.1) (/iAiVB) 
Trivial. 

(7.2) {^AND) => (l u ) 
Trivial. 

(8.1) Let /(X) C Y C X. Y - /(Y) g T(Y) C J(X) by (eMX). f(X) CY^X-YEX- f(X) g X(X), so by (iM) 
X - Y g I(X). Thus by (4,) X - /(Y) = (X - Y) U (Y - f(Y)) g J(X), so /(Y) g JF(X), so /(X) C f(Y) by definition. 

(8.2) (fiCUT) is too special to allow to deduce (eMI). Consider U := {a, 6, c}, X := {a, 6}, .F(X) = {X, {a}}, = {Z} 
for all other X ^ Z C U. Then (eMI) fails, as {6} £ I(X), but {6} 1(U). (iM) and (eMJ 7 ) hold. We have to check 
f(A) C B C A /(A) C /(B). The only case where it might fail is A = X, B = {a}, but it holds there, too. 

(9.1) By Fact rjXl] (page published as Fact 14 in |GS08cj . (6), we have (fj,CM) + (fiCUT) <£> (fiCUM), so the result 
follows from (5.1) and (8.1). 

(9.2) Consider the same example as in (8.2). /(A) C B C A =>■ /(A) = /(B) holds there, too, by the same argument as 
above. 

(10.1) Let /(X) C Y, f(Y) C X. So /(X), /(Y) C X n Y, and X - (X n Y) g 1(X), Y - (X n Y) g I(Y) by (iM). Thus 
/(X), f(Y) E T(X n Y) by (eMJF) and /(X) n f(Y) g JF(X n Y) by (/„). So X n Y - (/(X) n f(Y)) g X(X n Y), so 
X n Y - (/(X) n f(Y)) E T(X),X(Y) by (eMI), so (X - (x n Y)) u (x n Y - f(x) n /(Y)) = X - /(X) n /(Y) g X(X) 
by (/„), so /(X) n f(Y) E T(X), likewise /(X) n f(Y) g F(Y), so /(X) C /(X) n f(Y), f(Y) C /(X) n /(Y), and 
f(X) = f(Y). 

(10.2) Consider again the same example as in (8.2), we have to show that /(A) C B, /(B) C A => /(A) = /(B). The only 
interesting case is when one of A, B is X, but not both. Let e.g. A = X. We then have /(X) = {a}, /(B) = B C X, and 
/(X) = {a} C B, so B = {a}, and the condition holds. 

□ 



The product size defined by principal filters is discussed in Section [7.1.2.11 (page !125p . 
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Chapter 4 

Preferential structures - Part I 



This chapter, Part /, is dedicated to the basic case without conditions for the domain. 

The following chapter, Part II, will treat the case with supplementary conditions for the domain, as well as applications 
and special cases. 

Higher preferential structures will be treated in the next but one chapter. 

4.1 Introduction 

After the present section, we will treat in Section [4.2l fpage l61l) the case without conditions on the domain, and in Section[5_3] 
( page 155)) the case with the usual conditions on the domain, in particular closure under finite unions and finite intersections. 

But, first some general remarks 

4.1.1 Remarks on nonmonotonic logics and preferential semantics 

Nonmonotonic logics were, historically, studied from two different points of view: the syntactic side, where rules like (AND), 
(CUM) (see below, Definition 12.31 (page l30|) ) were postulated for their naturalness in reasoning, and from the semantic 
side, by the introduction of preferential structures (see Definition 14.1.11 (page [57)1 and Definition 14.1.21 (page [55)1 below) . 
This work was done on the one hand side by Gabbay |Gab85j , Makinson Mak94] , and others, and for the second approach 
by Shoham and others, see [Sho87b , |BS85j . Both approaches were brought together by Kraus, Lehmann, Magidor and 
others, sec [KLM90 , [LM92 , in their completeness results. 

A preferential structure M. defines a logic |~ by T |~ <j> iff <j> holds in all A4— minimal models of T. This is made precise in 
Definition 14.1.11 (page l57|) and Definition 14.1.21 (page l58|) below. At the same time, M. defines also a model set function, 
by assigning to the set of models of T the set of its minimal models. As logics can speak only about definable model 
sets (here the model set defined by T), M. defines a function from the definable sets of models to arbitrary model sets: 
'■ D(C) — > V{M(C)). This is the general framework, within which we will work most of the time. Different logics and 
situations (see e.g. Plausibility Logic, Section IQl fpage l9"Tj) . but also update situations, etc., Section 151 fpage [T53"|) ), will 
force us to generalize, we then consider functions / : y — * V(W), where W is an arbitrary set, and y C V(W). 

This Chapter is about representation proofs in the realm of preferential and related structures and concerns mainly the 
following points: 

(1) the importance of closure properties of the domain, in particular under finite unions, 

(2) the conditions affected by lack of definability preservation, 

(3) the limit version of preferential structures, 

(4) the problems and solutions for "hidden dimensions" , i.e. dimensions not directly observable. 

Concerning (1), the main new result is probably the fact that, in the absence of closure under finite unions, Cumulativity 
fans out to an infinity of different conditions. We also separate here clearly the main proof technique from simplifications 
possible in the case of closure under finite unions. 

Concerning (2), we examine in a systematic way conditions affected by absence of definability preservation, and use now 
more carefully crafted proofs which can be used in the cases with and those without definability preservation, achieving 
thus a simplification and a conceptually clearer approach. 

Concerning (3), we introduce the concept of an algebraic limit, to separate logical problems (i.e. due to lack of definability 
preservation) from algebraic ones of the limit variant. Again, this results in a better insight into problems and solutions of 
this variant. 

Concerning (4), we describe a problem common to several representation questions, where we cannot directly observe all 
dimensions of a result. 

Conceptually, one can subsume (2) and (4) under a more general notion of "blurred observation" , but the solutions appear 
to be sufficiently different to merit a separate treatment - at least at the present stage of investigation. 
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and avoids such losses, or, at least, reveals them clearly. E.g., ignorance is lost when we complete in arbitrary manner 
a partial order to a complete one, or when we choose arbitrarily some copy to be smaller than suitable other ones. The 
authors thinks that this, perhaps seemingly pedantic, attention will reveal itself as fruitful in future work. 

The present text is a continuation of the second author's |Sch04j . Many results presented here were already shown in ISch04] . 
but in a less systematic, more ad hoc way. The emphasis of this book is on describing the problems and machinery f 'bchind 
the scene" , on a clear separation of general problems and possible simplifications. In particular, we take a systematic 
look at closure of the domain (mainly under finite unions), conditions affected by lack of definability preservation, and an 
analysis of the limit variant, leading to the notion of an algebraic limit. 

Perhaps the single most important new result is the fact t hat C umula tivi ty fans out to an infinity of diffferent conditions 
in the absence of closure under finite unions (see Example 14.2.41 (page ITTj) ) . 

The systematic investigation of H{U, u) and H(U), sec Definition 14.2.41 fpage lTTj) is also new. 

Many proofs have been systematized, prerequisites have been better isolated, and the results are now more general, so they 
can be re-used more easily. In particular, we often first look at the case without closure under finite union, and obtain in 
a second step the case with this closure as a simplification of the former. 

The cases of "hidden dimensions" are now looked at in a more systematic way, and infinite variants are discussed and 
solved for the first time - to the authors' knowledge. In particular, such hidden dimensions are present in update situations, 
where we only observe the outcome, and can conclude about starting and intermediate steps only indirectly. We will see 
that this question leads to an (it seems) non-trivial problem, and how to circumvent it in a natural way, using the limit 
approach. 

The separation of the limit variant into structural limit, algebraic limit, and logical limit allows us to better identify 
problems, and also see that and how a number of problems here are just those arising also from lack of definability 
preservation. 

Finally, we solve some (to the authors' knowledge) open representation problems: Problems around Aqvist's deontic logic, 
and Booth's revision approach - see Section [7^1 (page [T55]) and Section f8. 2. 21 (page [TB5|) . 

The core of the completeness proofs consists in a general proof strategy, which is, essentially, a mathematical reformulation 
of the things we have to do. For instance, in general preferential structures, if x € X — fi(X), we need to "minimize" x by 
some other element. Yet, we do not know by which one. This leads in a natural way to consider copies of x, one for each 
x' G X, and to minimize such (x,x') by x' - somehow. Essentially, this is all there is to do in the most general case. As 
x may be in several X — (J-(X), we have to do the construction for all such X simultaneously, leading us to consider the 
product and choice functions. This is the basic proof construction - the mathematical counterpart of the representation 
idea. Of course, the same has to be done for above x' , so we will have (x, /), (#', /'), etc. Now, there is a problem: do 
we make all such (x' , /') smaller than the (x, /), only one, some of them? We simply do not know, they all give the same 
result in the basic construction, as we are basically interested only in the first coordinate - and will see in the outcome 
only this. Choosing one possibility is completely arbitrary, and such a choice is a loss of ignorance. An ideal proof should 
not do this, it should not commit beyond the necessary, it should "preserve ignorance". There is an easy way out: we 
just construct for all possibilities one structure - just as a classical theory may have more than one model. This inflation 
of structures is the honest solution. Of course, once we are aware of the problem, we can construct just one structure, 
but should remember the arbitrariness of our decision. This attention is not purely academic or pedantic, as we will see 
immediately when we try to make the construction transitive. If we make all copies of x' smaller than (x,f), then the 
structure cannot be made transitive by simple closure: we did not pay enough attention to what can happen "in future" . 
The solution is not to consider simple functions for copies, but trees of elements and functions, giving us complete control. 

So far, we did not care about prerequisites, we just looked at the necessary construction. 

This will be done in a second step, where we will see that sufficiently strong prerequisites, especially about closure of the 
domain of /i, in particular whether for X, X' in the domain X U X also is in the domain, can simplify the construction 
considerably. 

Thus, we have a core - the basic construction -, an initial part - eventual simplifications due to closure properties -, but are 
not yet finished: so far, we looked only at model sets (or, more generally, just arbitrary sets) and the algebraic properties 
of the }i— functions to represent, and we still have to translate our results to logic. As long as our \l— functions preserve 
definability, i.e., /u(M(T)j, M(T) the set of models of some theory T, is again M(T') for some, usually other, theory T', 
this is simple. But, of course, this is a very strong assumption in the general infinite case. Usually, ji{M(T)) will only 
generate a theory, but it will not be all of the models of this theory - logic is too poor to see that something is missing. So 
our observation of the result is blurred. 

Thus, there is still a final part to consider. We have shown in |Sch04j that the general case is impossible to characterize 
by traditional means, and we need other means - we have to work with "small" model sets: our observation can be a bit 
off the true situation, but not too much. This, again, can be seen as a problem of ignorance: we do not know the exact 
result, but only that it is not too far off from what we see. Thus, characterization will be exactly this: it gives a sort of 
upper limit of how far we can be off, and anything within those limits is possible, it does not give an exact result. 

This last part will be considered in a more systematic way, too. In particular, we will look at conditions which might be 
affected, and at those which will not be affected. At the same time, we take care to make our basic construction sufficiently 
general so we do not need that fi(X) is an element of the domain of \i. 

To summarize, we clearly distinguish here three parts: a core part with the essential construction, an initial part with 
possible simplifications when domain closure conditions are sufficient, and a final part concerning the sharpness of our 
observations. 

A problem which is conceptually similar to the definability preservation question is that of "hidden dimensions" . E.g. in 
update situations (but also in situations of a not necessarily symmetric distance based revision), we may see only the result, 
the last dimension, and initial and intermediate steps are hidden from direct observation. More precisely, we may know that 
the endpoints of preferred developments all are in some set X, but have no direct information where intermediate points 
are. Again, our tools of observation are not sufficiently sharp to see the precise picture. This may generate non-trivial 
problems, especially when we may have infinite descending chains. A natural solution is to consider here (partly or totally) 
the limit approach, where those things hold, which hold in the limit - i.e. the further we "approach" the (nonexisting) 
minimum, these properties will finallv hold. 
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This is another subject of the present text: 

We will introduce the concepts of the structural, the algebraic, and the logical limit, and will see that this allows us to 
separate problems in this usually quite difficult case. Some problems are simply due to the fact that a seemingly nice 
structural limit does not have nice algebraic properties any more, so it should not be considered. So, to have a "good" 
limit, the limit should not only capture the idea of a structural limit, but its algebraic counterpart should also capture the 
essential algebraic properties of the minimal choice functions. Other problems are due to the fact that the nice algebraic 
limit does not translate to nice logical properties, and we will see that this is often due to the same problems we saw in 
the absence of definability preservation. 

Thus, in a way, we come back in a cycle to the same problems again. This is one of the reasons the book form seems 
adequate for our results and problems: they are often interconnected, and a unified presentation seems the best. 

It might be useful to emphasize the parallel investigation in the minimal and the limit variant: 

For minimal preferential structures, we have: 

• logical laws or descriptions like a |~ a - they are the (imperfect - by definability preservation problems) reflection of 
the abstract description, 

• abstract or algebraic semantics, like n(A) C4 - they are the abstract description of the foundation, 

• structural semantics - they are the intuitive foundation. 

Likewise, for the limit situation, we have: 

• structural limits - they are again the foundation, 

• resulting abstract behaviour, which, again, has to be an abstract or algebraic limit, resulting from the structural 
limit, 

• a logical limit, which reflects the abstract limit, and may be plagued by definability preservation problems etc. when 
going from the model to the logics side. 

Note that these clear distinctions have some philosophical importance, too. The structures need an intuitive or philosophical 
justification, why do we describe preference by transitive relations, why do we admit copies, etc.? The resulting algebraic 
choice functions are void of such questions. 



4.1.2 Basic definitions 

The following two definitions make preferential structures precise. We first give the algebraic definition, and then the 
definition of the consequence relation generated by an preferential structure. In the algebraic definition, the set U is an 
arbitrary set, in the application to logic, this will be the set of classical models of the underlying propositional language. 

In both cases, we first present the simpler variant without copies, and then the one with copies. (Note that e.g. |KLM90j . 
|LM92j use labelling functions instead, the version without copies corresponds to injective labelling functions, the one with 
copies to the general case. These are just different ways of speaking.) We will discuss the difference between the version 
without and the version with copies below, where we show that the version with copies is strictly more expressive than 
the version without copies, and that transitivit y of the relati on adds new properties in the case without copies. When we 
summarize our own results below (see Section 14.2.2.21 (page [ST]) ), we will mention that, in the general case with copies, 
transitivity can be added without changing properties. 

We gi ve he re the "minimal version" , the much more complicated "limit version" is presented and discussed in Section 15.51 
(page [TUT]). Recall the intuition that the relation -< expresses "normality" or "importance" - the -< —smaller, the more 
normal or important. The smallest elements are those which count. 

Definition 4.1.1 

Fix U y£ 5 and consider arbitrary X. Note that this X has not necessarily anything to do with U, or U below. Thus, the 
functions /iyvt below are in principle functions from V to V - where V is the set theoretical universe we work in. 

Note that we work here often with copies of elements (or models). In other areas of logic, most authors work with valuation 
functions. Both definitions - copies or valuation functions - are equivalent, a copy (x,i) can be seen as a state {x,i) with 
valuation x. In the beginning of research on preferential structures, the notion of copies was widely used, whereas e.g. 
[KLM90] used that of valuation functions. There is perhaps a weak justification of the former terminology. In modal 
logic, even if two states have the same valid classical formulas, they might still be distinguishable by their valid modal 
formulas. But this depends on the fact that modality is in the object language. In most work on preferential stuctures, the 
consequence relation is outside the object language, so different states with same valuation are in a stronger sense copies 
of each other. 

(1) Preferential models or structures. 

(1 .1) The version without copies: 

A pair A4 := {U, -<) with U an arbitrary set, and -< an arbitrary binary relation on U is called a preferential 
model or structure. 

(1.2) The version with copies: 

A pair A4 := {Li, -<) with U an arbitrary set of pairs, and -< an arbitrary binary relation on IA is called a 
preferential model or structure. 

If (x, i) £ U, then x is intended to be an element of U, and i the index of the copy. 

We sometimes also need copies of the relation -<, wc will then replace -< by one or several arrows a attacking 
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(2) Minimal elements, the functions 

(2.1) The version without copies: 
Let Ai := (U, -<), and define 

Hm{X) := {x G X : x G U A ~^3x' eln U.x' -< x}. 

jtix(X) is called the set of minimal elements of X (in Ai). 

Thus, hm(X) is the set of elements such that there is no smaller one in X. 

(2.2) The version with copies: 

Let Ai :— (U, -<) be as above. Define 

Hm{X) := {x G X : 3{x, i) G U.^3{x' , i') G U{x' G X A (a;', i 1 )' < (x, i))}. 

Thus, hm{X) is the projection on the first coordinate of the set of elements such that there is no smaller one 
in X. 

Again, by abuse of language, we say that fiM(X) is the set of minimal elements of X in the structure. If the 
context is clear, we will also write just fi. 

We sometimes say that (x,i) "kills" or "minimizes" (y,j) if (x, i) -< (y,j)- By abuse of language we also say a 
set X kills or minimizes a set Y if for all (y,j) G U, y G Y there is (x, i) EU, x G X s.t. (x. i) -< (y,j). 

Ai is also called injective or 1-copy, iff there is always at most one copy (x, i) for each x. Note that the existence 
of copies corresponds to a non-injective labelling function - as is often used in nonclassical logic, e.g. modal 
logic. 

We say that Ai is transitive, irreflexive, etc., iff -< is. 
Note that fi(X) might well be empty, even if X is not. 

Definition 4.1.2 

We define the consequence relation of a preferential structure for a given prepositional language C. 

(1) (1.1) If m is a classical model of a language C, we say by abuse of language 

(to, i) \= <j) iff m \= <t>, 

and if X is a set of such pairs, that 

X\=<j>\£t for all (to, i) G X m |= <j>. 

(1.2) If Ai is a preferential structure, and X is a set of C— models for a classical prepositional language C, or a set of 
pairs (m,i), where the to are such models, we call Ai a classical preferential structure or model. 

(2) Validity in a preferential structure, or the semantical consequence relation defined by such a structure: 
Let Ai be as above. 

We define: 

T \=m 4> iSfi M (M(T)) h 0, i.e. ii M {M{T)) C M(<f>). 

Ai will be called definability preserving iff for all X G D c I^m(X) G D c- 

As hm is defined on Dei but need by no means always result in some new definable set, this is (and reveals itself as a 
quite strong) additional property. 

Example 4.1.1 

Th is simple example illustrates the importance of copies. Such examples seem to have appeared for the first time in print 
in [KLM90] . but can probably be attibuted to folklore. 

Consider the prepositional language C of two prepositional variables p, q, and the classical preferential model Ai defined 
by 

to |= p A q, ml |= p A q, TO2 \= ->p Aq, to 3 |= ->p A ~^q, with rri2 -< to, 7713 ~< to', and let be its consequence relation, (m 
and m! are logically identical.) 

Obviously, Th(m) V {->p} \=m "'Pi but there is no complete theory T' s.t. Th(m) V T' \=m ~^P- (If there were one, T' 
would correspond to to, TO2, TO3, or the missing TO4 (= pA -<q, but we need two models to kill all copies of to.) On the other 
hand, if there were just one copy of to, then one other model, i.e. a complete theory would suffice. More formally, if we 
admit at most one copy of each model in a structure Ai, m \/= T, and Th(m) V T \=m 4> f° r some cf> s.t. to |= -«f> - i.e. to 
is not minimal in the models of Th(m) V T - then there is a complete T' with T' h T and Th(m) V T" \=m i- e - there is 
to" with to" |= V and to" ^ to. □ 



We define now two additional properties of the relation, smoothness and rankedness. 
Definition 4.1.3 

Let y C V(U). (In applications to logic, y will be Dc-) 
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(1) The version without copies: 

If x G X G y, then either x G n{X) or there is x' G n(X).x' -< x. 

(2) The version with copies: 

If x G X G y, and (x,i) G Id, then either there is no (x',i') G U, x' G X, (x',i') -< (x, i) or there is (x',i') G 
(a:', i') -<! (a;, i), a;' G X, s.t. there is no (x", i") G a:" G X, with (x", i") -< (x' , i'). 

When considering the models of a language C, Ai will be called smooth iff it is £>£— smooth; Dc is the default. 
Obviously, the richer the set y is, the stronger the condition y— smoothness will be. 

Fact 4.1.1 

Let -< be an irreflexive, binary relation on X, then the following two conditions are equivalent: 

(1) There is ft and an irreflexive, total, binary relation -<' on ft and a function / : X — » ft s.t. x -< y <^> /(x) -<' /(y) for 
all x,y E X. 

(2) Let i,i/,z£l and x_Ly wrt. -< (i.e. neither x -< y nor y ~< x), then z -< x z -< y and x -< z =>■ y -< z. 



Proof 

(1) (2): Let x_Ly, thus neither fx -<' /y nor /y -<' /x, but then fx = fy. Let now z -< x, so /z -<! / /x = /y, so z -< y. 
x^z=>y^(z is similar. 

(2) (1): For x E X let [x] := {x' G X : x_Lx'}, and ft := {[x] : x G X}. For [x], [y] G ft let [x] -<' [y] :0 x -< y. This 
is well-defined: Let x_Lx', y-Ly' and x -< y, then x < y' and x' -< y'. Obviously, is an irreflexive, total binary relation. 
Define / : X — > ft by /x := [x], then x -< y «4> [x] -<' [y] <^> fx <' fy. □ 



Definition 4.1.4 

We call an irreflexive, binary relation -< on X, which satisfies (1) (equivalently (2)) of Fact 14. 1 .H ( page [59)) , ranked. By 
abuse of language, we also call a preferential structure (X, -<) ranked, iff -< is. 

The first condition says that if x G X is not a minimal clement of X, then there is x' G n(X) x' -< x. In the finite case 
without copies, smoothness is a trivial consequence of transitivity and lack of cycles. But note that in the other cases 
infinite descending chains might still exist, even if the smoothness condition holds, they are just "short-circuited" : we might 
have such chains, but below every element in the chain is a minimal element. In the authors' opinion, smoothness is difficult 
to justify as a structural property (or, in a more philosophical spirit, as a property of the world): why should we always 
have such minimal elements below non-minimal ones? Smoothness has, however, a justification from its consequences. Its 
attractiveness comes from two sides: 

First, it generates a very valuable logical property, cumulativity (CUM): If M. is smooth, and T is the set of \=m 
—consequences, then for TCT'GT=>T = T' . 

Second, for certain approaches, it facilitates completeness proofs, as we can look directly at "ideal" elements , withou t 
having to bother about intermediate stages. See in particular the work by Lehmann and his co-authors, [KLM90] . [LM92] . 

"Smoothness", or, as it is also called, "stopperedness" seems - in the authors' opinion - a misnamer. We think it should 
better be called something like "weak transitivity" : consider the case where a y b y c, but c -ft a, with c G m(A). It is then 
not necessarily the case that aye, but there is d "sufficiently close to c ", i.e. in n(X), s.t. a y c'. Results and proof 
techniques underline this idea. First, in the general case with copies, and in the smooth case (in the presence of (U)!), 
transitivity does not add new pro pert ies, it is "already present", second, the construction of smoothness by sequences a 
(see below in Section 14.2.2.31 (page 170)) ) is very close in spirit to a transitive construction. 

The second condition, rankedness, seems easier to justify already as a property of the structure. It says that, essentially, 
the elements are ordered in layers: If a and b are not comparable, then they are in the same layer. So, if c is above 
(below) a, it will also be above (below) b - like pancakes or geological strata. Apart from the triangle inequality (and 
leaving aside cardinality questions), this is then just a distance from some imaginary, ideal point. Again, this property has 
important consequences on the resulting model choice functions and consequence relations, making proof techniques for 
the non-ranked and the ranked case very different. 

y can have certain properties, in classical propositional logic for instance, if y is the set of formula defined model sets, 
then y is closed under complements, finite unions and finite intersection. If y is the set of theory defined model sets, y is 
closed under finite unions, arbitrary intersections, but not complements any more. 

The careful consideration of closure conditions of the domain was motivated by Lehmann's Plausibility Logic, see [Leh92a , 
and re-motivated by the work of Arieli and Avron, see |AA00| . In both cases, the language does not have a built-in "or" - 
resulting in absence (U) of the domain. 

When trying to show completen ess of Lehmann's system, the second author noted the importance of the closure of the 
domain under (U), see Sch96-3 . The work of Arieli and Avron incited him to look at this property in a more systematic 
way which lead to the discovery of Example 14.2.41 (page [71}, and thus of the enormous strength of closure of the domain 
under finite unions, and, more generally, of the importance of domain closure conditions. 

In the resulting completeness proofs again, a strategy of "divide and conquer" is useful. This helps us to unify (or extend) 
our past completeness proofs for the smooth case in the following way: We will identify more clearly than in the past a 
more or less simple algebraic property - (HU), (HU,u) etc. - which allows us to split the proofs into two parts. The first 
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independent from closure properties, and is essentially an "administrative" way to use the property for a construction. 
This split approach allows us thus to isolate the demonstration of the used property from the construction itself, bringing 
both parts clearer to light, and simplifying the proofs, by using common parts. 

The reader will see that the successively more complicated conditions (HU), (HU,u) reflect well the successively more 
complicated situations of representation: 

(HU) : smooth (and transitive) structures in the presence of (U), 
(HU,u) : smooth structures in the absence of (U), 

This comparison becomes clearer when we see that in the final, most complicated case, we will have to carry around all 
the history of minimization, (Yq, . . . , Y n ), necessary for transitivity, which could be summarized in the first case with finite 
unions. Thus, from an abstract point of view, it is a very natural development. 

In the rest of this Section |4~T1 (page [55]) . we will only describe the problems to solve, without giving a solution. This will 
be done in the next sections. Moreover, we will asume that we have precise knowledge of /, i.e. what we see as f(X) for 
X £ y is really the result, and not some approximation - as we will permit later, in Section [5.41 (page [53]) • 

So this part is a leisurely description of problems and things to do. We start with the most general case, arbitrary 
preferential structures, turn to transitive such structures, then to smooth, then to smooth transitive ones, and conclude 
by ranked and ranked smooth structures. 

Throughout, we will try to preserve ignorance, i.e. not assume anything we are not forced to assume. This will become 
clearer in a moment. Once we have understood the problem, we will sometimes just gloss over it by choosing one solution, 
but we should always be conscious that there is a problem. 

We will consider here choice functions / : y — > V(W), where y C V(W), and the problems to represent them by various 
preferential structures. 

We will see the following basic representation problems and the constructions to solve them, i.e. to find representing 
structures for 



(1) General preferential structures 



(2) General transitive preferential structures 



(3) Smooth preferential structures 



(4) Smooth transitive preferential structures 



(5) Ranked preferential structures 



(6) Smooth ranked preferential structures 



The problems of and solutions to the ranked case are quite different from the first four cases. In particular, the situation 
when ^ U £ y, but f(U) = does not present major difficulties in cases (1) - (4), but is quite nasty in the last case. 



4.1.2.1 The situation 

We work in some universe W, there is a function / : y — » V(W), where y C V(W), f will have certain properties, and 
perhaps y, too, and we will try to represent / by a preferential structure Z of a certain type, i.e. we want / = fiz, with 
fiz the /i— function or choice function of a preferential structure Z. Note that the codomain of / is not necessarily a subset 
of y - so we have to pay attention not to apply / twice. 

Before we go into details, we give now an overview of the results. 

The following t able s umm arize s representation by preferential structures. The positive implications on the right are shown 
in Proposition 12.3.41 (page [36]) (going via the fi— functions), those on the left are shown in the respective representation 
theorems. 

"singletons" means that the domain must contain all singletons, "1 copy" or " > 1 copy" means that the structure may 
contain only 1 copy for each point, or several, " (/x0) " etc. for the preferential structure mean that the fi— function of the 
structure has to satisfy this property. 

We call a characterization "normal" iff it is a universally quantified boolean combination (of any fixed, but perhaps infinite, 
length) of rules of the usual form. We do not go into details here. 

In the second column from the left " => " means, for instance for the smooth case, that for any y closed under finite unions, 
and any choice function / which satisfies the conditions in the left hand column, there is a (here y— smooth) preferential 
structure X which represents it, i.e. for all Y £ y f(Y) = fj,x(Y), where fix is the model choice function of the structure X. 
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4.2 Preferential structures without domain conditions 
4.2.1 General discussion 

We treat in this Section the general case without conditions on the domain. We will see that it is more difficult than when 
we can impose the usual conditions (closure under finite intersections and finite unions). The latter case will be dealt with 
briefly (as most of it was already done in |Sch04j) in Section [5J] (page [53]) . 

4.2.1.1 General preferential structures 

We give now just three simple facts to put the reader in the right mood for what follows. 
Fact 4.2.1 
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Proof 

Trivial. The central argument is: if x, y G X C Y, and x -< J/ in X, then also x -< y in Y". 
□ 



Fact 4.2.2 

(/■* C)i (M-P-^)) an d (fiCUM) hold in all smooth preferential structures. 
Proof 

By Fact |4~2~T1 (page 161]) . we only have to show (nCUM). By Fact |2"XT1 (page [521) . (fiCUT) follows from (/iPi?), so it 
remains to show (fj,CM). So suppose n(X) C y C I, we have to show n(Y) C fi(X). Let x G X — fi(X), so there is x' G X, 
x' -< x, by smoothness, there must be x" G fi(X), x" -< x, so x" G Y", and x g" /u(Y). The proof for the case with copies is 
analogous. 

Example 4.2.1 

This example was first given in [Sch92]. It shows that condition (PR) may fail in preferential structures which are not 
definability preserving. 

Let v(C) := {pi : i G w}, n, n' G Mc be defined by n \= {pi : i G u>}, n' \= {->po} U {pi : < i < u}. 

Let M. :— (Mc,<) where only n -< n', i.e. just two models are comparable. Note that the structure is transitive and 
smooth. Thus, by Fact EjU (page |B2J| {ji C), (pPP), (fiCUM) hold. 

Let /it := fiM, and |~ be defined as usual by /x. 

Set T := 0, T' := {p 4 : < i < u}. We have M T = M c , /(Mr) = M £ - {n'}, Mr/ = {n,n'}, /(Mr/) = {n}. So by 
the result of Example 12.2.11 (page [30)l . / is not definability preserving, and, furthermore, T = T, T' = {pi : i < ui}, so 

p Q G TUT', but TUT 1 = TUT' = T 7 , so p G" TUT', contradicting (Pi?), which holds in all definability preserving 
preferential structures □ 



We know from Fact l4.2.T] (page [61} that / has to satisfy (/i C) and (fiPR). Let then it G U E y. 

If u G f(U), then, for this u, this ?7 there is nothing to do, we just have to take care that at least one copy of u will be 
minimal in U. 

If u f(U), then u must be minimized by some u' £ U - more precisely: It might well be that in all smaller U' is not 

minimized: U' % U, U' G y, u G U' => u G /([/'). If e.g. U = {u, u', u"}, and U' = {u, u'}, U" = {tt, u"} with [/', 17" G y, 
then there cannot be u -i u, nor it' -< u, nor m" -< u, and we have to make copies of it, so that only in U, but neither in U' 
nor in U" all copies are minimized. Thus, what we have to do, is to create {u,u), (u,v!), (u,u"), and to make u -< {u,u), 
v! -< (u,u'), u" -< (u,u") (or something similar). Thus, in the presence of full U, all copies will be minimized, but in all 

c 

U' 7^ U at least one copy of u is not minimized. When we look now at our construction, we note the following: (a) u 
is minimized in U, (b) we took no commitment for other U' C U. Thus, we might not know anything about such other 
U', and leave this question totally open - we preserved our ignorance, (c) the construction is independent of all other U' 
- except that any U' with U C U' will also minimize u, but this was an inevitable fact. Note that there might also be 
U' C U with u G U' — f{U'), but no minimal one (wrt. C). 

W e can see the problem of copies also as preservation of ignorance, and can also solve it with many structures - as is done 
in jSGMRTOOj . 

We have to do this construction now also for other u and U, so we will perhaps introduce copies for other elements, too, 
suppose we have copies (u',y) and (u',y') for the above it' . As (i is insensitive to the particular index, and it wants only 
at least one copy of u' to be smaller than (u, u'}, we have a problem we cannot decide: Shall we make (u',y) -< (it, u'), 
or (vl ,y') -< (tt,it'), or both? Deciding for one solution would go beyond our knowledge (though it would do no harm, 
representation would be preserved), and we would not preserve our ignorance. The only honest solution to the problem is 
to admit that we do not know, and branch into all possible cases, i.e. for any nonempty subset X' of the copies of u' , we 
make all copies (vf, y) G X' smaller than (u, it'). Thus, we construct many structures instead of one, and say: the real one 
is one of them, but we do not know. 

Note that all these structures will be different, as points which are logically indiscernible will be different from an order 
theoretic point of view. We should also note the parallel here to Kripke models for modal logic, where the standard 
construction works with complete consistent theories in the full □— language, with nested D's etc., where we might see the 
differences between two points only when following arrows to some depths. Here, the situation is similar: (it, u ) >~ (u',y) 
and (it, vf) y (u', y') are the same on level 0: it is in both cases u. On level 1, they are the same two, as we see in both 
cases it', and only in level 2, they may begin to look differently: y and y' may choose different successors. 

Once we are aware of the problem - i.e. we do not know enough for a decision - we can, of course, choose one sufficient for 
our representation purpose. But it is important to see the arbitrariness of the decision we take. The natural solution will 
then be to decide for making ALL copies of it' smaller than (u, it'). 
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copy. Then, in the presence of all elements of U, or all elements of U', all copies will be minimized. The solution is thus 
to consider all copies (u,g), where g G H{X £}":iiel - f(X)}. 

We will define (u, g) >- (y, h) iff y G ran(g) - this is the adequate condition - and forget about h, but keep in mind that we 
took here an arbitrary decision, and should, to preserve ignorance, branch into all possibilities. We will see its importance 
immediately, now. 



4.2.1.2 Transitive preferential structures 

The Example 1 4. 2. 3 1 (pagc l68p shows that we cannot just make the above construction transitive and preserve representation. 
This is an illustration of the fact that we have to be careful about excessive relations. 

The new construction avoids this, as it "looks ahead": 

Seen from one fixed point, an arbitrary relation is a graph where we identify >- with — >, i.e. u >~ x will be written u — > x. 
The picture is perhaps easier to read when we write this graph as a tree, repeating nodes when necessary. So, from the 
starting point it, we can go to x and x', from x to y and y', from x' to w and w' and w" etc. So we can write the tree of 
all direct and indirect successors of u, t(u), and if a; is a direct successor of it, then the tree for x, t(x), will be a subtree of 
t(it), beginning at the successor x of the root in t(u). 

This gives as now a method to control all direct and indirect successors of an element. We write as index above tree, and 
define (u,t(u)) >- (x,t(x)) iff t(x) is the subtree of t(u) which begins at the direct successor x of u. In the next step, we 
make the relation transitive, of course, we now have to see that this can be done without destroying representation, and 
we will use in our construction special choice functions, which always choose for u G Y — f(u) u itself - this is allowed, and 
they will do what we want. The details are given in the formal construction below. 



4.2.1.3 Smooth structures 

In analogy to Case (1), and with the same argument, we will consider choice functions g G II{/(X) : x G X — f(X)}. 

(In the final construction, we will construct simultaneously for all u 1 U s.t. it G f(U) a [/—minimal copy, so in the following 
intuitive discussion, it will suffice to find minimal u, x, etc. with the required properties. This remark is destined for 
readers who wonder how this will all fit together. We should also note that we will again be in the dilemma which copy to 
make smaller, and will do so for all candidates - violating our principle of preserving ignorance. Yet, as before, as long as 
we are aware of it, it will do no harm.) 

To see the new problem arising now, we start with U, and suppose that it G f(U). Let now it G X — f(X), then we have 
to find x G f(X) below u. First, x must not be in U, as we would have destroyed minimality of u in U, this is analogous 
to Case (1), so we need f(X) — U^%. But let now u G f(Y), x G Y. In Case (1), it was sufficient to find another copy 
of u, which is minimal in Y. Now, we have to do more: to find an y G f(Y), y below u, so smoothness will hold. We will 
call the following process the "repairing process for u, x, and Y ". Suppose then that it G f(Y), and x G f(Y) for Y G y. 
Then we have destroyed minimality of u in Y, but have repaired smoothness immediately again by finding the minimal x. 
The situation is different if x G Y — f(Y) (and there was no x' ~< u x' G f(Y) chosen at the same time). Then we have 
destroyed minimality of u in Y, without repairing smoothness, and we have to repair it now by finding suitable y -< u, 
y G /(Y). Of course, y must not be in U, as this would destroy minimality of u in U. 

Thus, we have to find for all Y with u G f(Y), x G Y — f(Y) some y G f(Y), y -< u, y 4. U. Note that this repair process 
is individual, i.e. we do not have to find one universal y which repairs lost minimality for ALL such Y at the same time, 
but it suffices to do it one by one, individually for every single such Y. 

But now, the solutions y for such Y may have introduced new problems: Not only x is below u, but also y is below u. If 
there is now Z G y s.t. u G f{Z), and y G Z — f(Z), then we have to do the same repairing process for u, y, Z : find 
suitable z G f(Z) below u, z g" U, etc. So we will have an infinite repairing process, where each step may introduce new 
problems, which will be repaired in the next step. 

To illustrate that the problem is still a bit more complicated, we make a definition, and see that we have to avoid in above 
situation not only U, but H(U, u), to be defined now. 

H(U,u) := U, 

H(U,u) a+ i := H(U,u) a U 1J{^ '■ u E X A n(X) C H(U,u) a }, 
H(U,u) x := {j{H{U,u) a : a < A} for limit(X), 
H(U 7 u) := \J{H(U, u) a : a < k} for k sufficiently big 

(card(Z) suffices, as the procedure trivializes, when we cannot add any new elements). 
(HU, u) is the property: 

u G fi(U), u G Y - jii(Y) fx(Y) % H(U, u) - of course for all u and U. 

(U,Yey). 
Fact 4.2.3 

(HU, u) holds in smooth structures. 

The proof is given in Fact 14.2.221 fpage[76 l) (2). 

We note now that we have to consider our principle of preserving ignorance again: We can choose first arbitrary y G f{Y) 
to repair for u, x, Y. So which one we choose is - a priori - an arbitrary choice. Yet, this choice might have repercussions 
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z £ H(U,u), but no suitable z', etc. So the, at first sight, arbitrary choice might reveal an impasse later on. We will see 
that we can easily solve the problem in the not necessarily transitive case, but we do not see at the time of writing any 
easy solution in the transitive case, if the domain is not necessarily closed under finite unions. 

4.2.1.4 Transitive smooth structures 

The basic, now more complicated, situation to consider is the following: 

Let again u £ f(U), u £ A — f(X), we have to find x £ /(A) - outside H(U,u) as in Case (3). Thus, we need again 
f{X) — H(U,u) / 0. Again, we have to repair all damage done, i.e. for all u, x, Y as discussed in Case (3), the infinite 
repair process discussed there. 

Suppose now that x £ Y — f(Y), so we have to find y £ f(Y), outside H (U, u) by transitivity of the relation, as y -< x -< u, 
and, in addition outside H(X,x), as in Case (3), now for X and x. Thus, we need f(Y) — (H(U,u) U H{X,x)) ^ 0. 
Moreover, we have to do the same for all elements y introduced by the above repairing process. Again, we have to do 
repairing: y -< u, and y -< x, so for all Y' s.t. u £ f(Y'), y £ Y' — f(Y') we have to repair for u, y, Y', and if x £ f(Y'), 
y £ Y' — f(Y') we have to repair for x, y, Y', creating new smaller elements, etc. 

If y £ Z — f(Z), we have to find z £ f(Z), outside H(U,u), H(X,x), H(Y,y), etc., so the further we go down, the longer 
the condition will be. Thus, we need f(Z) — (H(U,u) U H(X,x) U H(Y,y)) ^ 0. And, again we have to repair, for u, z, 
x, z, and y, z. 

And so on. 

Note again the arbitrariness of choice, when there is not a unique solution, i.e. no unique x, y, z etc. This has to be 
considered when we want to respect preservation of ignorance, but also an early wrong choice might lead to an impasse, 
leading to backtracking to this early wrong choice. 

We will see that the closure of the domain under (U) makes all this easily possible, but the authors do not see an easy 
solution in the absence of (U) at the time of writing - the problem is an initial potentially wrong choice, which we do not 
see how to avoid other than by trying. 

So we will give here only a formal negative result by an example, see Example 14.2.51 (page 1751) . and essentially repeat the 
result given in |Sch04j using (U), see Proposition l5.1.1l (pagel83|). presented in Section [57X1 (page [83]) . 

4.2.1.5 Ranked structures 

We give here some definitions, and show elementary facts about ranked structures. We also prove a general abstract 
nonsense fact about extending relations, to be used here and also later on. 

The crucial fact will be Lemma 14.2.81 (page |6"5|) , it shows that we can do with either one or infinitely many copies of 
each model. The reason behind it is the following: Suppose we have exactly two copies of one model, to, to', where 
to and ml have the same logical properties. If, e.g., m -< to', then, as we consider only minimal elements, to' will be 
"invisible" . If to and to' are incomparable, then, by rankedness (modularity), they will have the same elements above (and 
below) themselves: they have the same behavior in the preferential structure. An immediate consequence is the "singleton 
property" of Lemma 14.2.81 (page : One element suffices to destroy minimality, and it suffices to look at pairs (and 
singletons). 

We first note the following trivial 
Fact 4.2.4 

In a ranked structure, smoothness and the condition 
(/x0) X ± =► n(X) + 
are (almost) equivalent. 

Proof 

Suppose (/i0) holds, and let x £ X — /i(A"), x' £ n{X). Then x' -< x by rankedness. Conversely, if the structure is smooth 
and there is an element x £ X in the structure (recall that structures may have "gaps", but this condition is a minor point, 
which we shall neglect here - this is the precise meaning of "almost" ), then either x £ m(A) or there is x' -< x, x' £ fJ-(X), 
so n(X) ± 0. □ 



Fact 4.2.5 

In the presence of (fx =) and (ji C), f(Y) n(X- f(X)) ^ is equivalent to f{Y) n X ^ and f{Y) n f(X) = 0. 
Proof 

f(Y) n (A - f(x)) = (f(Y) n x) - (f(Y) n /(A)). 

«<*=": Let f(Y) n A ? 0, f(Y) n /(A) = 0, so /(F) n (A - /(A)) ? 0. 

" => " : Suppose f(Y) n (A - /(A)) ^ 0, so /(F) n A ^ 0. Suppose f{Y) n /(A) ^ 0, so by (ji C) f{Y) n A n Y ^ 0, so by 
( M =) f(Y) n A n Y = /(A n Y), and /(A) n A n Y ? 0, so by (/x =) /(A) n A n Y = /(A n Y), so /(A) n Y = f(Y) n A 
and f(Y) n (A - f(X)) = 0. 
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Fact 4.2.6 

If -< on X is ranked, and free of cycles, then -< is transitive. 
Proof 

Let x -< y -< z. If a;_Lz, then y y z, resulting in a cycle of length 2. If z -< x, then we have a cycle of length 3. So x -< z. □ 

The following Fact is essentially Fact 3.10.8 of |Sch04j . 
Fact 4.2.7 

In all ranked structures, (fi C), (// =), (/iPR), (/x ='), (/i ||), (/iU), (/iU'), (/U E), (/iRatM) will hold, if the corresponding 
closure conditions are satisfied. 

Proof 

(/x C) and (fiPR) hold in all preferential structures. 
(fx =) and (/i =') are trivial. 

(/xU) and (/AJ') : All minimal copies of elements in f(Y) have the same rank. If some y E f(Y) has all its minimal copies 
killed by an element x £ X, by rankedness, X kills the rest, too. 

(/i £) : If f({a}) = 0, we are done. Take the minimal copies of a in {a}, they are all killed by one element in X. 

(/i ||) : Case f(X) = : If below every copy of y E Y there is a copy of some x £ X, then f(X IJY) =0. Otherwise 
/(X U Y) = f (Y). Suppose now f(X) ^ 0, f(Y) ^ 0, then the minimal ranks decide: if they are equal, f(X U Y) = 
/(X)U/(y),etc. 

(fiRatM) : Let icy^eln f{Y) ^ 0, a; £ f(X). By rankedness, y ~< x, or yi-x, y -< a; is impossible, as y E X, so yJ-X, 
and x £ /(F). 

□ 



Definition 4.2.1 

Let 2 = (A", -<) be a preferential structure. Call Z 1 — oo over Z, iff for all x £ Z there are exactly one or infinitely many 
copies of x, i.e. for all x £ Z {it £ X : u — (x, i) for some i} has cardinality 1 or > lu. 

The following Lemma is Lemma 3.10.4 of |Sch04| . 

Lemma 4.2.8 

Let Z = (A, -<) be a preferential structure and / : y — > V(Z) with y C 'P(Z) be represented by Z, i.e. for X E y 
f(X) = nz(X), and -E be ranked and free of cycles. Then there is a structure Z 1 , 1 — oo over Z, ranked and free of cycles, 
which also represents /. 

Proof 

We construct Z' = (X' , -<'}. 

Let A := {x £ Z: there is some (x,i) £ X, but for all (x,i) £ X there is (x,j) £ X with (x, j) -< (x,i)}, 
let 5 := {x £ Z: there is some (x,i) £ A", s.t. for no (x,j) £ A" (x,j) -< (x, £)}, 
let C := {x E Z: there is no (x, i) £ A*}. 

Let Ci : i < k be an enumeration of C. We introduce for each such Cj cj many copies (cj,n) : n < u into A', put all (cj, n) 
above all elements in A, and order the (c,,n) by (cj,n) -<' (ci>,n') :<^> (i = i' and n > n') or i > i'. Thus, all {cj,n) are 
comparable. 

If a £ A, then there are infinitely many copies of a in A, as A was cycle-free, we put them all into A'. If b £ S, we 
choose exactly one such minimal element (6,m) (i.e. there is no (6, n) -< (b, to}) into A', and omit all other elements. (For 
definitcness, assume in all applications m = 0.) For all elements from A and B, we take the restriction of the order -< of 
X. This is the new structure Z' . 

Obviously, adding the (cj,n) does not introduce cycles, irreflexivity and rankedness are preserved. Moreover, any sub- 
structure of a cycle-free, irreflexive, ranked structure also has these properties, so Z' is 1 — oo over Z, ranked and free of 
cycles. 

We show that Z and Z' are equivalent. Let then X C Z, we have to prove fi(X) — fi'(X) (/i := /i^, // := fiz')- 

Let z £ X — /i(X). If z £ C or z £ A, then z g" [i'(X). If z £ i?, let (z, m) be the chosen element. As z g" n(X), there is 
x <E X s.t. some (x, j) -< (z, m). a; cannot be in C. If a; £ A, then also (x, j) -< / (z, to). If a; £ B, then there is some (a;, k) 



GG 



CHAPTER 4. PREFERENTIAL STRUCTURES - PART I 



Let z£l - /J>'(X). If z G C or z G A, then z fi(X). Let zeB, and some (a;, j) -<' (z, to), a; cannot be in C, as they were 
sorted on top, so (x,j) exists in X too and (x,j) -< (z,m). But if any other (z,i) is also minimal in Z among the (2, ft), 
then by rankedness also (x,j) ~< (z,i), as (z, i)_L(z, m), so z ^ mPO- n 



Notation 4.2.1 

We fix the following notation: A :— {x e Z : f(x) — 0} and B := Z — A (here and in future we sometimes write f(x) for 
f({x}), likewise f(x,x') = x for f({x, x'}) = {x}, etc., when the meaning is obvious). 

Corollary 4.2.9 

If / can be represented by a ranked Z free of cycles, then there is Z', which is also ranked and cycle-free, all b G B occur 
in 1 copy, all a G A 00 often. □ 



The following Example was presented in Fact 3.10.13 of [Sch04 . 
Example 4.2.2 

This example shows that the conditions (fi C) + (fiPR) + (fi =) + (fiD) + (p g) can be satisfied, and still representation by 
a ranked structure is impossible. 

Consider fi({a, b}) = 0, /i({a}) = {a}, fi({b}) = {&}. The conditions (fx C) + {jiPR) + (/*=) + (/iU) + (/i g) hold trivially. 
This is representable, e.g. by a\ ^61 ^ 62 • • • without transitivity. (Note that rankedness implies transitivity, 

a <b < but not for a = c.) But this cannot be represented by a ranked structure: As /i({a}) ^ 0, there must be a copy 
a, of minimal rank, likewise for b and some bi. If they have the same rank, /x({a, 6}) = {a, 6}, otherwise it will be {a} or 
{b}. 

□ 



In the general situation we have possibly [7^0, but /({/) = 0. 

In this case, we only know that below each u G U, there must be infinitely many v! G U or infinitely ma ny cop ies of suc h 
u' G U, (It is only in such cases that we need copies for representation in ranked structures, see Lemma 14.2.81 (page I65|) .) 
Thus, the amount of information we hav e is ve ry s mall. It is not surprising that representation problems are now difficult, 
as we will see below (see Section 14.2.2.51 fpagel^Ul) ), and we will not go into more details here. 

4.2.2 Detailed discussion 

4.2.2.1 General preferential structures 

The material in this Section is taken from |Sch04j . Section 3.2.1 there, the result was already shown in )Sch92] with the 
same methods. 

Proposition 4.2.10 

Let /j : y — » V{U) satisfy (/1 C) and (/iPR). Then there is a preferential structure X s.t. fj, = [ix- See e.g. |Sch04] . 
Proof 

The preferential structure is defined in Construction I4.2TT1 fpage Wl\ . Claim FT 2. 121 fpage Wl\ shows representation. The 
construction is basic for much of the rest of the material on non-ranked structures. 

Definition 4.2.2 

For x G Z, let y x := {Y G 3>: x G Y - A* 00}, U x := Uy x . 
Note that f y x , U x ^ 0, and that TL X = {0} iff y x = 0. 



Claim 4.2.11 

Let fi : y V(Z) satisfy (fi C) and (fiPR), and let U G y. Then x G fi(U) <^ x G U A 3/ G U x .ran(f) <1 U = 0. 
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Proof 

Case 1: 34 = 0, thus 11^ = {0}. " => ": Take / := 0. "<*=": x £ [7 £ J 7 , 34 = => a; £ a*(C7) by definition of 34. 

Case 2: 34 ^ 0. " ": Let a: £ /x(£7) £ C 7 - It suffices to show Y £ 34 =>• F - [7 ^ 0. But if V C U and Y £ 34, then 
x £ y - fx(Y), contradicting (iiPR). " <= ": If x £ [7- /z(£7), then [7 £ 34, so V/ £ n a; .ran(/) D U ^ 0. □ 



Construction 4.2.1 

Let A" := {(x, /) : x £ Z A / £ 11*}, and (x', /') -<! (x, /) x' £ ron(/). Let Z := {X, ■<). 



Claim 4.2.12 

FovUey, ii{U)=n z {U). 



Proof 

By Claim |4~2~TT1 (page 155)1. it suffices to show that for all U £ y x £ M-zC^O ^ z £ U and 3/ £ lL x ra n{f) f~l U = 0. 
So let L/ £ 3>. " ": If x £ fiz(U), then there is (x, /) minimal in A?[£7 (recall from Definition [2~TTT1 (page [27)) that 
#[[7 := {(x,i) £ X : x £ t/}), so x £ J7, and there is no (x',/') -< (x,/), x' £ {/, so by 11^' ^ there is no x' £ ran(f), 
x' £ [7, but then ran(f) n [7 = 0. "-«=": If x £ U, and there is / € Tl x , ran(f) n [7 = 0, then (x, /) is minimal in ,Y|~[7. □ 
(Claim 14.2.121 (page E7J) and Proposition 14.2.101 (page [6JD ) 



4.2.2.2 Transitive preferential structures 

The material in this Section is taken from |Sch04] . Section 3.2.2 there, the result was already shown in Sch92 with different 
methods. 

We show here: 
Proposition 4.2.13 

Let n : y — > V{U) satisfy (/i C) and (fiPR). Then there is a transitive preferential structure X s.t. fi = \ix- See e.g. 
|Sch04j . 

Proof 

4.2.2.2.1 Discussion: 

The Construction 14.2.11 (page [67|) (also used in [Sch92|) can not be made transitive as it is, this will be shown below in 
Example 14.2.31 fpagel6"5 )) . The second construction in |Sch92] is a special one, which is transitive, but uses heavily lack of 
smoothness. (For completeness' sake, we give a similar proof in Proposition 14. 2 . 171 ( page 170)).) We present here a more 
flexibe l and mo re adequate construction, which avoids a certain excess in the relation -< of the construction in Proposition 
14.2.171 fpage [70)l : There, too many elements (y,g) are smaller than some (x, /), as the relation is independent from g. This 
excess prevents transitivity. 

We refine now the construction of the relation, to have better control over successors. 

Recall that a tree of height < to seems the right way to encode the successors of an element, as far as transitivity is concerned 
(which speaks only about finite chains). Now, in the basic construction, different copies have different successors, chosen 
by different functions (elements of the cartesian product). As it suffices to make one copy of the successor smaller than 
the element to be minimized, we do the following: Let (x, <?}, with g £ TL{X : x £ X — f(X)} be one of the elements of the 
standard construction. Let (x', g') be s.t. x' £ ran(g), then we make again copies (x, g, </}, etc. for each such x' and g', and 
make only (x',g'), but not some other (x',g") smaller than (x,p.p') for some other g" £ H{X' : x' £ X' — f(X')}. Thus, 

i i i • t i . • i in.. .1 ■ j n r «i t , l n 
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product. An element with its tree is a su ccessor of a noth er element with its tree, iff the former is an initial segment of the 
latter - see the definition in Construction 14.2.21 (page [68]) . 

Recall also that transitivity is for free as we can use the element itself to minimize it. This is made precise by the use 
of the trees tf x for a given element x and choice function f x . But they also serve another purpose. The trees tf x are 
constructed as follows: The root is x, the first branching is done according to f x , and then we continue with constant 
choice. Let, e.g. x' £ ran(f x ), we can now always choose x' , as it will be a legal successor of x 1 itself, being present in all 
X 1 s.t. x' £ X' — f{X'). So we have a tree which branches once, directly above the root, and is then constant without 
branching. Obviously, this is essentially equivalent to the old construction in the not necessarily transitive case. This 
shows two things: first, the construction with trees gives the same \x as the old construction with simple choice functions. 
Second, even if we consider successors of successors, nothing changes: we are still with the old x' . Consequently, considering 
the transitive closure will not change matters, an element (x 7 tf x ) will be minimized by its direct successors iff it will be 
minimized by direct and indirect successors. If you like, the trees tf x are the mathematical construction expressing the 
intuition that we know so little about minimization that we have to consider suicide a serious possibility - the intuitive 
reason why transitivity imposes no new conditions. 

To summarize: Trees seem the right way to encode all the information needed for full control over successors for the 
transitive case. The special trees tf x show that we have not changed things substantially, i.e. the new \x— functions in the 
simple case and for the transitive closure stay the same. We hope that this construction will show its usefulness in other 
contexts, its naturalness and generality seem to be a good promise. 

We give below the example which shows that the old construction is too brutal for transitivity to hold. 

Recall that transitivity permits substitution in the following sense: If (the two copies of) x is killed by y\ and j/2 together, 
and j/i is killed by z\ and z-i together, then x should be killed by z\, Z2, and j/2 together. 

But the old construction substitutes too much: In the old construction, we considered elements (x, /), where / £ Ii x , with 
(y,g) -< (x, /) iff y £ ran(f), independent of g. This construction can, in general, not be made transitive, as Example l4.2.3l 
(page US]) below shows. 

The new construction avoids this, as it "looks ahead", and not all elements (yi,t yi ) are smaller than (x,t x ), where y\ is a 
child of x in t x (or y\ £ ran(f)). The new construction is basically the same as Construction 14 . 2 . II (page [57]), but avoids 
to make too many copies smaller than the copy to be killed. 

Recall that we need no new properties of [i to achieve transitivity here, as a killed element x might (partially) "commit 
suicide", i.e. for some i, i' (x,i) -< (x,i'}, so we cannot substitute x by any set which does not contain x : In this simple 
situation, if x £ X — fJ.(X), we cannot find out whether all copies of x are killed by some y ^ x, y £ X. We can assume 
without loss of generality that there is an infinite descending chain of a;— copies, which are not killed by other elements. 
Thus, we cannot replace any j/j as above by any set which does not contain j/j, but then substitution becomes trivial, as 
any set substituting t/j has to contain y^. Thus, we need no new properties to achieve transitivity. 



Example 4.2.3 

As we consider only one set in each case, we can index with elements, instead of with functions. So suppose x, 2/1,2/2 £ X, 
yi,zi,Z2 £ Y, x £ n(X), yi $ n(Y), and that we need 2/1 and 2/2 to minimize x, so there are two copies (x,yi), (#,2/2)1 
likewise we need z\ and Z2 to minimize 2/1, thus we have (x,y±) y (y%,zi), (x,yi) y {yx,z%), (x, 2/2) >~ 2/2, {yii z i) >~ %u 
{yii z 2) r~ Z2 (the Zi and 2/2 are not killed). If we take the transitive closure, we have (x, y\) y Zk for any i, k, so for any Zk 
{^,2/2} will minimize all of x, which is not intended. □ 



The preferential structure is defined in Construction 14.2.21 (page |6"8"]) , Claim 14.2.151 (page |6U|) shows representation for the 
simple structure, Claim |4". 2. 161 fpage I69[) representation for the transitive closure of the structure. 

The main idea is to use th e trees tf x , whose elements are ex actly the elements of the range of the choice function /. This 
makes Construction I4.27T1 fpage [67|) and Construction 14=. 2721 fpage [68]) basically equivalent, and shows that the transitive 
case is characterized by the same conditions as the general case. These trees are defined below in Fact I4.2.T"41 fpage |6U]). 
(3), and used in the proofs of Claim |4~2 . 1 51 fpage |6"5]) and Claim |4~2. 161 fpagc lM]) . 

Again, Construction 14.2.21 (page [68]) contains the basic idea for the treatment of the transitive case. It can certainly be 
re-used in other contexts. 

Construction 4.2.2 

(1) For x £ Z 1 let T x be the set of trees t x s.t. 

(a) all nodes are elements of Z, 

(b) the root of t x is x, 

(c) height(t x ) < lo, 

(d) if y is an element in t x , then there is / £ U y := H{Y £ y-. y £ Y — fi(Y)} s.t. the set of children of y is ran(f). 
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(3) Let Z := ( {(x,t x ) : x £ Z, t x £ T x }, (x,t x ) y (y,t y ) iff t x > t y ). 
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Fact 4.2.14 

(1) The construction ends at some y iff y y = 0, consequently T a = {a;} iff 34 = 0- (We identify the tree of height 1 with 
its root.) 

(2) If y x ^ 0, tc x , the totally ordered tree of height u), branching with card — 1, and with all elements equal to x is an 
element of T x . Thus, with (1), T x ^ for any x. 

(3) If / £ Tl x , f ^ 0, then the tree tf x with root x and otherwise composed of the subtrees t y for y £ ran(f), where ij, := ty 
iff X/ = 0, and t y :— tc y iff y y ^ 0, is an element of T x . (Level of tf x has x as element, the t' y s begin at level 1.) 

(4) If y is an element in t x and t y the subtree of t x starting at y, then t y £ T y . 

(5) (x,t x ) y (y,t y ) implies y £ ran(f) for some / £ n x . □ 



Claim [4~2. 151 (page l6"9")) shows basic representation. 
Claim 4.2.15 

vuey.ii(u) = n z (u) 



Proof 

By Claim S2TU (page [66]), it suffices to show that for all U £ y x £ M-Z^) x £ C7 A 3/ € n x .ra?i(/) n Z7 = 0. Fix 
E7 £ y. " =>• ": a; £ Hz{U) =>- ex. (x,t x ) minimal in thus ie(J and there is no (y,t y ) £ Z, (y, i y ) -< (x,t x ), y £ U. 

Let / define the set of children of the root x in If ran(f) DU ^ 0, if y £ {/ is a child of a; in t x , and if ij, is the subtree 
of t x starting at y, then t y £ T y and (y,t y ) -< (x,^), contradicting minimality of {x,t x ) in Z|77. So ran(f) C\U = 0. " <^ 

": Let x £ [/. If 34 = 0, then the tree x has no [>— successors, and (x, x) is > minimal in Z. If 34 7^ and / £ IT X s.t. 

ran(f) D U = 0, then (x, t/ x ) is > minimal in Z[~[/. □ 



We consider now the transitive closure of Z. (Recall that -<* denotes the transitive closure of -< .) Claim W.2. 161 (page |6"9"]) 
shows that transitivity does not destroy what we have achieved. The trees tf x will play a crucial role in the demonstration. 

Claim 4.2.16 

Let Z' := ( {(x,t x ) : x £ Z, t x £ T x }, (x,t x ) y (y,t y ) iff t x t>* t y }. 
Then [i z = ^z>- 



Proof 

Suppose there is U £ y, x £ U, x £ nz(U), x $ nz'(U). Then there must be an element (x, t x ) £ Z with no (x, t x ) y (y, t y ) 
for any y £ [/. Let f £U X determine the set of children of x mt x , then ran(f) fl U = 0, consider As all elements ^ x 
of f/a; are already in ran(f), no element of tf x is in U. Thus there is no (z, t z ) -<* (x,tf x ) in Z with z £ {/, so (x,tf x ) is 
minimal in Z'\U, contradiction. □ (Claim fl.2. 161 (page |6"9")) and Proposition 14. 2. 131 (pagel BTj) ) 
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Proposition 4.2.17 

In the general case, every preferential structure is equivalent to a transitive one - i.e. they have the same \i— functions. 
Proof 

If (a.i) y {b,j), we create an infinite descending chain of new copies (b, (j,a,i,ri)), n G u>, where (b, (J,a,i,n)) y 
(b, (J, a, i, n')) if n' > n, and make (a,i) y (6, (j,a,i,n)} for all n G to, but cancel the pair (a,i) y (b,j) from the re- 
lation (otherwise, we would not have achieved anything), but (b,j) stays as element in the set. Now, the relation is trivially 
transitive, and all these (b, (j,a,i,n)) just kill themselves, there is no need to minimize them by anything else. We just 
continued (a, i) y (b,j) in a way it cannot bother us. For the (b,j), we do of course the same thing again. So, we have full 
equivalence, i.e. the /i— functions of both structures are identical (this is trivial to see). □ 

4.2.2.3 Smooth structures 

4.2.2.3.1 Introduction 

4.2.2.3.2 Cumulativity without (U) We show here that, without sufficient closure properties, there is an infinity 
of versions of cumulativity, which collapse to usual cumulativity when the domain is closed under finite unions. Closure 
properties thus reveal themselves as a powerful tool to show independence of properties. 

We then show positive results for the smooth and the transitive smooth case. 

We work in some fixed arbitrary set Z, all sets considered will be subsets of Z. 

Unless said otherwise, we use without further mentioning (fiPR) and (jj, C). 

Note that (fiPR) and (/j C) entail fi(A UB) C fj,(A) U (i(B) whenever fj, is defined for A, B, A U B. (fj,(A U B) n A C fi(A), 
fi(A U B) n B C fi(B), by {fiPR), but fi(A U B) C A U B by (ji C).) 

Definition 4.2.3 

For any ordinal a, we define 
(l^Cuma) : 

If for all < a fi(X ) CUU U{^ 7 : 7 < 0} hold, then so does f|{^ 7 : 7 < «} H (i(U) C fj,(X a ). 
{liCumta) : 

If for all < a n(X ) CUU \J{X^ : 7 < 0} hold, then so does X a n fi(U) C /x(X a ). 
( " t " stands for transitive, see Fact 14.2.181 (page 175]) . (2.2) below.) 

(pCumoo) and {fiCumtoo) will be the class of all (fj,Cuma) or (fiCumta) - read their "conjunction" , i.e. if we say that 
(pCumoo) holds, we mean that all (nCuma) hold. 

Note: 

The first conditions thus have the form: 
(pCumO) fi(X ) CU ^ XoH fi(U) C fj,(X ), 

(fxCuml) n(X ) C [/, C [/ U X ^ X n n /i(f7) C /ti(Xi), 

(^Cum2) /x(X ) C [7, m(^i) QUUXq, h{X 2 ) CPUl Uli4 

XonlinXjn/i^c^Xj). 

(pCumta) differs from (fxCuma) only in the consequence, the intersection contains only the last X a - in particular, 
(/iCumO) and (^C'umtO) coincide. 

Recall that condition (fiCuml) is the crucial condition in Leh92a], which failed, despite (fiCUM), but which has to hold 
in all smooth models. This condition (/xCuml) was the starting point of the investigation. 

We briefly mention some major results on above conditions, taken from Fact 14.2.181 (page [73]) and shown there - we use 
the same numbering: 

(1.1) (/iCuma) =>■ (fiCum0) for all < a 

(1.2) (fiCumta) =>■ (fxCumtfi) for all < a 

(2.1) All (/iCuma) hold in smooth preferential structures 

(2.2) All (fiCumta) hold in transitive smooth preferential structures 

(3.1) (fiCumP) + (U) => (fiCuma) for all < a 

(3.2) (fj,Cumt0) + (U) =$> (pCumta) for all /3 < a 

(5.2) (nCuma) => (fiCUM) for all a 

(5.3) (nCUM) + (U) (fiCuma) for all a 

t^u^ P„ll™,;„„ urn „A mrr „A „i~r,,,™ u„ 
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Definition 4.2.4 

(H(U,u) a , H(U) a , (HU,u), (HU).) 
H(U,u) := U, 

H(U,u) a+1 := H(U,u) a U 1J{^ : u E X A fx(X) C H(U,u) a }, 
H(U,u)x ■= \J{H(U,u) a : a < A} for limit(X), 

H(U,u) :— [J{H(U,u) a : a < k} for k sufficiently big (card(Z) suffices, as 
the procedure trivializes, when we cannot add any new elements). 
(HU,u) is the property: 

u e fJ,(U), uEY - fi(Y) fx(Y) 1 H(U, u) - of course for all u and U. (U, Y E y). 
Thus, (HU, u) entails n{U) C H{U, u), u e ^(Z7) 07 4u£ /x(Y). 
For the case with (U), we further define, independent of it, 
H(U)o := [/, 

Jf(l7)a+i := H(U) a U U{* : M*) C #(£/)„}, 
i2"(?7) A := U{-ff(^)c : a < A} for limit(X), 
H(U) := \J{H(U) a : a < k} again for k sufficiently big 
(HU) is the property: 

u e fi(U), ueY - fi(Y) g H(f7) - of course for all U. (U, Y E y). 

Thus, (HU) entails n(Y) C F({7) ^(Z7) flFC /i(F). 

Obviously, ff(J7,ti) C H(U), so (ifZJ) ^ (HU,u). 
Example 4.2.4 

This important example shows that the conditions (/iCuma) and (fiCumta) defined in Definition 14.2.31 fpage [70)1 are all 
different in the absence of (U), in its presence they all collapse (see Fact 14.2.181 (page 175)1 below). More precisely, the 
following (class of) examples shows that the (fiCuma) increase in strength. For any finite or infinite ordinal n > we 
construct an example s.t. 

(a) (fiPR) and (n C) hold 

(b) (iiCUM) holds 

(c) (0) holds 

(d) (jiCumtct) holds for a < k 

(e) (jiCumK) fails. 

Proof 

We define a suitable base set and a non-transitive binary relation -< on this set, as well as a suitable set X of subsets, closed 
under arbitrary intersections, but not under finite unions, and define /i on these subsets as usual in preferential structures 
by -< . Thus, (fiPR) and (fi C) will hold. It will be immediate that (fiCumK) fails, and we will show that (fiCUM) and 
(pCumta) for a < k hold by examining the cases. 

For simplicity, we first define a set of generators for X, and close under (P|) afterwards. The set U will have a special 
position, it is the "useful" starting point to construct chains corresponding to above definitions of (fiCuma) and (fiCumta). 

In the sequel, i,j will be successor ordinals, A etc. limit ordinals, a, (3, n any ordinals, thus e.g. A < k will imply that A is 
a limit ordinal < k, etc. 

The base set and the relation -<: 

k > is fixed, but arbitrary. We go up to k > 0. 

The base set is {a, 6, c} U {d\ : A < k} U {x a : a < k + 1} U {x' a : a < n}. a -< b -< c, x a -< x a -i x' a , x' -< x\ (for 

any A) — < is NOT transitive. 

The generators: 

U := {a, c, Xo} U {d\ : X < k} - i.c {d\ : lim(X) A A < k}, 

Xi := {c, x^x'^Xi+i} (i < «), 

X x := {c, d\, xa, x' x , xa+i} U {x' a : a < X} (A < k), 

X' K := {a, b, c, x K , x' K , x K+ i} if k is a successor, 

X' K := {a, b, c, d K , x K , x' K , x K+ i} U {x' a : a < k} if k is a limit. 

Thus, X' K = X K U {a, 6} if AT K were defined. 

Note that there is only one X' K , and X a is defined only for a < n, so we will not have X a and at the same time. 
Thus, the values of the generators under a are: 
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fi(Xi) = {c,Xi}, 

n{X x ) = {c,d x }\j{x' a :a<\}, 

n(X-) = {a, Xi} (i > 0, i has to be a successor), 

H(X' x ) = {a,d x }L){x' a :a<\}. 

(We do not assume that the domain is closed under /x.) 
Intersections: 

We consider first pairwise intersections: 

(1) unx = {c,x }, 

(2) [/ni, = {c},i>o, 

(3) unx x = {c,d x }, 

(4) UC\X[ = {a,c} (i > 0), 

(5) UnX' x = {a, Cl d x }, 

(6) A, n Xj : 

(6.1) j = i + 1 {c,x m }, 

(6.2) else {c}, 

(7) I.fll,: 

(7.1) i<A{c,:B , < }, 

(7.2) i = A + l {c,x A+1 }, 

(7.3) i > A + 1 {c}, 

(8) A A n A A / : {c} U {x^ : a < min(X, A')}. 

As X' K occurs only once, X a n X^ etc. give no new results. 
Note that /x is constant on all these pairwise intersections. 
Iterated intersections: 

As c is an element of all sets, sets of the type {c, z} do not give any new results. The possible subsets of {a, c, d x } : {c}, {a, c}, 
{c, d A } exist already. Thus, the only source of new sets via iterated intersections is X x HA A < = {c}L){x' a : a < min(\, A')}. 
But, to intersect them, or with some old sets, will not generate any new sets either. Consequently, the example satisfies 
(p|) for X defined by U, Xi (i < k), X x (A < k), X' k , and above paiwise intersections. 

We will now verify the positive properties. This is tedious, but straightforward, we have to check the different cases. 
Validity of (nCUM) : 

Consider the prerequisite n(X) C Y C X. If /x(X) = X or if X — fi(X) is a singleton, X cannot give a violation of (/jCUM). 
So we are left with the following candidates for X : 

(1) Xi := {c, Xi^x'^Xi+i}, fj,(Xi) = {c,Xi} 

Interesting candidates for Y will have 3 elements, but they will all contain a. (If k < lo : U = {a, c, Xo}.) 

(2) X x := {c,d\,x\,x' x ,x\+i} U {x' a : a < A}, n(X x ) = {c,d x } U {x' a : a < A} 

The only sets to contain d x are X x , U, U n X x . But a e U, and [/ n X x is finite. (A A and X' x cannot be present at the 
same time.) 

(3) X[ := {a,6, c, Xi,x^,x i+ i}, fJ,{X[) = {a.Xi} 

a is only in U, X-, U (1 X • = {a, c}, but x, [/, as i > 0. 

( 4 ) : = {"^ & > c ^ x a, ar' A , x A+ i} U {x' a : a < A}, /x(A A ) = {a, d x } U {x^ : a < A} 
d x is only in X' x and [7, but U contains no x' a . 

Thus, {fiCUM) holds trivially. 
(fiCumta) hold for a < k : 

To simplify language, we say that we reach Y from X iff A ^ 7 and there is a sequence Xp, (3 < a and fi(Xp) C XUlJ{A 7 : 
7 < /3}, and A Q = Y, A = A. Failure of (/jCumta) would then mean that there are A and Y, we can reach Y from A, 
and x e (m(A) flY")- M^O- Thus, in a counterexample, Y = /x(Y) is impossible, so none of the intersections can be such 

To reach Y from A, we have to get started from A, i.e. there must be Z s.t. l-i(Z) C A, Z £ A (so /x(Z) ^ Z). Inspection 
of the different cases shows that we cannot reach any set Y from any case of the intersections, except from (1), (6.1), (7.2). 

If Y contains a globally minimal element (i.e. there is no smaller element in any set), it can only be reached from any A 
which already contains this element. The globally minimal elements are a, xo, and the d x , A < k. 

By these observations, we see that A A and X' K can only be reached from U. From no A Q U can be reached, as the globally 
minimal a is missing. But U cannot be reached from X' K either, as the globally minimal xo is missing. 

When we look at the relation ^ defining /i, we see that we can reach Y from A only by going upwards, adding bigger 
elements. Thus, from X a , we cannot reach any Xp, f3 < a, the same holds for X' K and Xp, (3 < k. Thus, from X' K , we 
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Consider now X a . We can go up to any X a+n , but not to any X\, a < A, as d\ is missing, neither to X' K , as a is missing. 
And we will be stopped by the first A > a, as x\ will be missing to go beyond X\. Analogous observations hold for the 
remaining intersections (1), (6.1), (7.2). But in all these sets we can reach, we will not destroy minimality of any element 
of X a (or of the intersections). 

Consequently, the only candidates for failure will all start with U. As the only element of U not globally minimal is c, such 
failure has to have c e Y — fi(Y), so Y has to be X' K . Suppose we omit one of the X a in the sequence going up to X' K . If 
k > A > a, we cannot reach X\ and beyond, as x' a will be missing. But we cannot go to X a+n either, as x a +i is missing. 
So we will be stopped at X a . Thus, to see failure, we need the full sequence U = Xq, X' k = Y K , Y a = X a for < a < k. 

(fiCumn) fails: 

The full sequence U — A , X' K = Y K , Y a = X a for < a < k shows this, as c £ n{U) fl X' K , but c n(X' K ). 

Consequently, the example satisfies (f)), (fiCUM), (fiCumta) for a < k, and (iiCumn) fails. 

□ 



Fact 4.2.18 

We summarize some properties of (/zCuma) and (/jCumta) - sometimes with some redundancy. Unless said otherwise, a, 
etc. will be arbitrary ordinals. 

For (1) to (6) (fiPR) and (fi C) are assumed to hold, for (7) only (/x C). 

(1) Downward: 

(1.1) {fiCuma) =>■ (fiCum(3) for all < a 

(1.2) (nCumta) (jiCumt0) for all < a 

(2) Validity of (/iCuma) and (/iCumta): 

(2.1) All (fiCuma) hold in smooth preferential structures 

(2.2) All (/iCumta) hold in transitive smooth preferential structures 

(2.3) (fiCumta) for < a do not necessarily hold in smooth structures without transitivity, even in the presence of (|~)) 

(3) Upward: 

(3.1) (fj,Cum0) + (U) => [pCuma) for all < a 

(3.2) (fj,Cumt0) + (U) (fiCumta) for all < a 

(3.3) {{nCumt0) : <a} + (fiCUM) + (f|) +> (jiCuma) for a > 0. 

(4) Connection (fiCuma) / (jCumta): 

(4.1) (jCumta) =>■ (/iCuma) 

(4.2) (fiCuma) + (f|) A {jiCumta) 

(4.3) (nCuma) + (U) (jCumta) 

(5) (fiCUM) and (fiCumi): 

(5.1) {juCUM) + (U) entail: 

(5.1.1) /i(A) CB^/i( J 4UB)= /i(B) 

(5.1.2) /i(A) c [/, [/ c r ^ it(y u A) = /i(r) 

(5.1.3) /i(A) c [/, t/ c y it(y) n at c ^(?7) 

(5.2) (fxCuma) (pCUM) for all a 

(5.3) (fiCUM) + (U) (fiCuma) for all a 

(5.4) (fiCUM) + (n) (iiCumO) 

(6) {nCUM) and fiCumta): 

(6.1) (fiCumta) (fiCUM) for all a 

(6.2) (fiCUM) + (U) (fiCumta) for all a 

(6.3) (fiCUM) ■/* (pCumta) for all a > 

(7) (/xCWO) (/i-P-R) 

Proof 

We prove these facts in a different order: (1), (2), (5.1), (5.2), (4.1), (6.1), (6.2), (5.3), (3.1), (3.2), (4.2), (4.3), (5.4), (3.3), 
(6.3), (7). 

(1.1) 

For < 7 < a set A 7 := A^. Let the prerequisites of (fiCum0) hold. Then for 7 with /3 < 7 < a /i(A 7 ) C Xp by (/i C), 
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(1.2) 

Analogous. 
(2.1) 

Proof by induction. 

(uCumO) Let u(X n ) Q U, suppose there is x G fJ-(U) n (X — /x(Xo)). By smoothness, there is y ~< x, y G A^-Xo) C [/, 
contradiction (The same arguments works for copies: all copies of x must be minimized by some y £ u(X ), but at least 
one copy of x has to be minimal in U.) 

Suppose (uCum(3) hold for all /3 < a. We show (fiCuma). Let the prerequisites of (fiCuma) hold, then those for (uCum(3), 
[3 < a hold, too. Suppose there is x £ u{U) CI P|{^7 : 7 ^ a } ~ n(X a )- So by (uCumfi) iov (3 < a x £ n(Xp) for all (3 < a, 
moreover x £ £t(i/). By smoothness, there is y £ u(X a ) C [/ U |J{^/3' : < a }> 2/ x > but this is a contradiction. The 
same argument works again for copies. 

(2.2) 

We use the following Fact: Let, in a smooth transitive structure, fi(Xp) C U U U{^7 '■ J < 0} f° r au P < ct, and let 
x G Then there is no y -< x, y £ U U U{-^7 : 7 — hi- 

proof of the Fact by induction: Suppose such y £ U U X exists, y £ U is impossible. Let y £ X 0} by u(X n ) C [/, 
y G A"o — fi(Xo), so there is z £ u(Xq), z -< y, so z -< x by transitivity, but n(Xo) C £/. Let the result hold for all f3 < a, 
but fail for a, so ->3y -< x.y G [/ U U{^7 : 7 < ^li but Ely -< x.y £ U U U{-^7 : 7 ^ a }> so y £ X a . If y G n(X a ), then 
y £ U U U{^7 : 7 < a }> but this is impossible, so y £ X a — fi(X a ), let by smoothness z -<y, z £ u(X a ), so by transitivity 
z -< x, contradiction. The result is easily modified for the case with copies. 

Let the prerequisites of {fiCumta) hold, then those of the Fact will hold, too. Let now x £ fi(U) PI (X a — /j,(X a )), by 
smoothness, there must be y ~< x, y £ fJ,{X a ) C U U U{^7 : 7 < a }-> contradicting the Fact. 

(2.3) 

Let a > 0, and consider the following structure over {a, b, c} : U := {a, c}, Xo := {&,c}, X Q :=...:= X\ := {a, 6}, and 
their intersections, {a}, {6}, {c}, with the order c -< b -< a (without transitivity). This is preferential, so (p,PR) and 
(u C) hold. The structure is smooth for U, all Xp, and their intersections. We have fi(Xo) C [/, /J,(Xp) C [/ U for all 
/3 < a, so ^(X^) Cf/U 1J{^7 : 7 < /?} for all /3 < a but X Q n = {a} % {b} = (j,(X a ) for a > 0. 

(5.1) 

(5.1.1) /x(A) CB4 /i(A UB)C /1(A) U /x(B) C B ^ {f , C UM) - /i(A U B). 

(5.1.2) C [/ C y (by (5.1.1)) fi(Y UX) = fi(Y). 

(5.1.3) n(Y) n X = (by (5.1.2)) ^UljnlC M (Y Ul)n(lU[/)C (by (pPR)) n{X UU) = (by (5.1.1)) u(U). 
(5.2) 

Using (1.1), it suffices to show (/iCumO) (fiCUM). Let fj,(X) C [/ C X By (pCumO) X n /x(f/) C M (X), so by 
u(U) C [/ C X /i(J7) C ^(X). f7 C X /i(X) nt/C M ([/) by (uPR), but also //(X) C U, so /i(X) C M ([/). 

(4.1) 

Trivial. 

(6.1) 

Follows from (4.1) and (5.2). 
(6.2) 

Let the prerequisites of (uCumta) hold. 

We first show by induction u(X a U U) C /z(Z7). 

Proof: 

a = : /i(J5f ) C f/ u(X U U) = u(U) by (5.1.1). Let for all /? < a u(X p U U) C u(U) C f/. By prerequisite, 
u(X a ) <ZUU \J{X :(3<a}, thus /i(X Q U(/) C ^(X a ) U ^(f/) C \J{U \J X fj : (3 < a], 

moreover for all (3 < a [i{X (i U U) C U £ X a U U, so u(X a UU)n(Ul) X ) C /_i(U) by (5.1.3), thus f i(X a U U) C //({/). 

Consequently, under the above prerequisites, we have /Li(X a U?7) C /u(?7) C {/ C J7UX Q , so by (fiCUM) (i(U) = /j,(X a \JU), 
and, finally, /i(t/) nl„ = ^(X Q U U) n X Q C /i(X Q ) by (/xPii). 

Note that finite unions take us over the limit step, essentially, as all steps collapse, and u(X a U U) will always be fJ,(U), so 
there are no real changes. 

(5.3) 

Follows from (6.2) and (4.1). 
(3.1) 

Follows from (5.2) and (5.3). 
(3.2) 

Follows from (6.1) and (6.2). 
(4.2) 
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(4.3) 

Follows from (5.2) and (6.2). 
(5.4) 

n(x) c u => ct/nia^ fi(x nu) = n(X) ^in /i(c/) = (X n u) n ^(Z7) c m (a n c/) = ^PO 

(3.3) 

See Example l4~24l (page [71]) . 
(6.3) 

See Example l4~2~4l (page [7TJ) . 
(7) 

Trivial. Let X C F, so by C) (j,(X) CIcy,soby (^CutoO) X n fi(Y) C /xpQ. 
□ 



Fact 4.2.19 

Assume (a* C). 

We have for (fxCumoo) and (HU,u): 

(1) z G m(F), m(F) C H(U,x) ^FC ff(f/,a;) 

(2) {pLCumoo) => (HU, u) 

(3) (HU,u) => (^Cumoo) 

Proof 

(1) 

Trivial by definition of H (U, x). 
(2) 

Let x G n(U), x G F, /i(F) C H(U,x) (and thus F C H(U,x) by definition). Thus, we have a sequence Ao := CA 
M-^s) C C7 U 1J{^7 : 7 < 35 ^ X^, and F = X a for some a (after X , enumerate arbitrarily H(U, x)i, then H(U, x)2, 
etc., do nothing at limits). So x G Pl{^7 : 7 < n /u(J7) C (J,(X a ) = fi(Y) by (/iCumoo). 

Remark: The same argument shows that we can replace " x G X " equivalently by " x € ^(X) " in the definition of 
H(U,x) a +i, as was done in Definition 3.7.5 in |Sch04] . 

(3) 

Suppose (fiCuma) fails, we show that then so does (HU, u) for u = x. As ([iCuma) fails, for all j3 < a fi(Xp) C [/UlJ{X 7 : 
7 < (3}, but there is a; € f|{^7 : 7 < a}n/i(£7), x £ n(X a ). Thus for all (3 < a n{Xp) C C H(U, x), moreover x G n(U), 
x G A Q — /i(X Q ), but /x(X a ) C H(U, x), so (-ff/7, u) fails for u = x. 
□ 



Fact 4.2.20 

We continue to show results for H(U) and H(U,u). 
Let A, X, U,U',Y and all A t be in y. 

(0) #([/) and H(U, u) 
(0.1) C H(U), 
(0.2) (FEZ) => (HU,u), 

(0.3) (U) + (^Pi?) entail H{U) C for it G ^i(^), 

(0.4) (U) + (fiPR) entail (#[/» => (iff/), 

(1) (/j C) and (iJC7) entail: 

(1.1) (»PR), 

(1.2) (nCUM), 

(3) (a* C) and (^.PR) entail: 

(3.1) A = U{^ 4 : * G /} => /i(^) C UM^) • * G /}, 

(3.2) £7 C H(U), and J7 C [/' => H(U) C ff(t/'), 

(3.3) £t(f/ U F) - #([/") C ^(F) - if //([/ U F) is defined, in particular, if (U) holds. 

(4) (U), (a C), (uPR), (uCUM) entail: 
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(4.2) U C A, ll(A) C H{U) fi(A) C {/, 

(4.3) /j(Y) C ff(t7) ^ Y C i7 (Z7) and /x([/ UY) = 

(4.4) x £ ll(U), x EY - f i(Y) ^>Y % H(U) (and thus (HU)), 

(4.5) Y % H(U) => ll(U U Y) % H(U). 
(5) (U), (jiC), (HU) entail 

(5.1) H(U) = H 1 (U), 

(5.2) UCA, u(A) C tf(E7) =>■ C [/, 

(5.3) fx(Y) C ff({7) =>■ Y c 77 ([/) and /i(C/ U Y) = ji(E7), 

(5.4) x G fi(U), x e Y - m(Y) ^ Y ^ 77(rj), 

(5.5) Y % H(U) => fi(U U Y) ^ H(U). 

Fact 4.2.21 

(0.1) and (0.2) trivial by definition. 

(0.3) Proof by induction. H(U)q = H(U,u) is trivial. Suppose H(U)p = H(U,u)p has been shown for (3 < a. The 
limit step is trivial, so suppose a = /3 + 1. Let X be such that /x(X) C H(U)p = H(U,u)p, so X C H(U) a . Consider 
lUf/,so«elU[/, U [/) is defined and by (/iP-R) and (// C) u(X U 17) C //(X) U /x(C/) C 77 (U)p = H(U, u)p, so 
IU[/C H(U,u) a . 

(0.4) Immediate by (0.3). 

(1.1) By (i7C/), if fi(Y) C i7(£f), then /i(£7) n Y C /x(Y). But, if Y C [/, then /z(Y) C 7f(f/) by (/i C). 

(1.2) Let n[U) CXCU. Then by (1.1) li(U) = ll(U) C\ X C li(X). By fi(U) C X and (/x C) /j(Z7) C[/C ffpf), so by 
(HU) andIC[/ and (/u C), fi(X) = fi(X) n 17 C /*(£/) by (/j C). 

(3.1) ^(A) n C a(A 3 ) C U/*(-Ai), so by /x(A) C A= (J^i A*(^) Q UmW 

(3.2) trivial. 

(3.3) li(U U Y) - 77(f7) C (3 . 2) M (C/ U Y) - f/ C (by (fx C)) M (C/ U Y) n Y C (juPH) M (Y). 

(4.1) We show that, if X C 77 2 (J7), then X C Hi(U), more precisely, if Li(X) C 77i(Z7), then already X C 77i(f7), so 
the construction stops already at H\(U). Suppose then fi(X) C [J{Y : /x(Y) C [/}, and let A := X U U. We show that 
/x(j4) C U, so X C A C i7i(J7). Let a 6 ^t(A). By (/xPP), (/x C), /x(A) C /x (jf) U j x(U). If a e /x(f7) C [7, we are done. 
If a S /x(X), there is Y s.t. u(Y) C [7 and a € Y, so a e /x(A) n Y. By Fact 14.2.181 fpage l73"]) , (5.1.3), we have for Y s.t. 
M(Y) C U and (JCi A*(-4) H Y C /x(J7). Thus a G /x([7), and we are done again. 

(4.2) Let UCA, n(A) C i7(C7) = 77i(?7) by (4.1). So fi(A) = \J{^{A) n Y : /x(Y) C C/} C ^(17) C U, again by Fact 14.2.181 
(pageESJ), (5.1.3). 

(4.3) Let u{Y) C i7(C7), then by /x(C/) C H{U) and (/xPi?), (jti C), fj,(U U Y) C /x(J7) U /i(Y) C ff({7), so by (4.2) 
fi(U U Y) C U and 17 U Y C i7(C7). Moreover, /x(t7 U Y) C {7 C U U Y ^^cum) KU U Y) = //([/). 

(4.4) If not, Y C 77(C7), so /u(Y) C H(U), so ^(C/ U Y) = fi(U) by (4.3), but x 6 Y - /i(Y) ^ (AI p fi) at ^ ^(C7 U Y) = //([/), 
contradiciion. 

(4.5) /i(C7 U Y) C 77(C/) =^ (4 . 3) [/ U Y C i7([7). 

(5) Trivial by (1) and (4). 

□ 



We turn now to the representation result and its proof. 

We adapt Proposition 3.7.15 in [Sch04] and its proof. All we need is (HU,u) and (fj, C). We modify the proof of Remark 
3.7.13 (1) in Sch04 (now Remark f4.2.23l (page ITT)) ) so we will not need (fl) any more. We will give the full proof, although 
its essential elements have already been published, for three reasons: First, the new version will need less prerequisites 
than the old proof does (closure under finite intersections is not needed any more, and replaced by (HU,u)). Second, we 
will more clearly separate the requirements to do the construction from the construction itself, thus splitting the proof 
neatly into two parts. 

We show how to work with (fx C) and (HU,u) only. Thus, once we have shown (fj, C) and (HU,u), we have finished the 
substantial side, and enter the administrative part, which will not use any prerequisites about domain closure any more. 
At the same time, this gives a uniform proof of the difficult part for the case with and without (U), in the former case we 
can even work with the stronger H(U). The easy direction of the former parts needs a proof of the stronger H(U), but 
this is easy. 

Note that, in the presence of (/x C), (HU,u) (fiCumoo) and (llCuttlQ) (uPR), by Fact 14.2.191 (page l75|) . (3) and Fact 
14.2.181 fpage [73)) . (7), so (HU, u) entails (liPR), so we can use it in our context, where (HU, u) will be the central property. 



Fact 4.2.22 
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Proof 

(1) Trivial by definition. 

(2) Suppose not. So let x € ^(C/), x G Y — fJ,(Y), fi(Y) C i7(£/, a;). By smoothness, there is X\ G fJ-(Y), x >- Xi, and let K\ 
be the least k s.t. Xi G 77(C/, x) Kl . k\ is not a limit, and xi G U' — fi(U') with a; G U' by definition of H(U, x) for some 
J/^, so as x\ ^ fi(U' xl ), there must be (by smoothness) some other x 2 G ^{U X1 ) C i7(J7, x) Kl -i with a; >- x 2 - Continue with 
x 2 , we thus construct a descending chain of ordinals, which cannot be infinite, so there must be x n G (i(U x ) C.U,x>- x ni 
contradicting minimality of x in U. (More precisely, this works for all copies of x.) □ 

We first show two basic facts and then turn to the main result, Proposition 14.2.251 fpage[77|). 
Definition 4.2.5 

For x G Z, let W x := {jJ,{Y): Fg^AxgF - (J.(Y)}, T x := UW X , and K := {x G Z: 3X G y.x G /J-(X)}. 

Remark 4.2.23 

(1) x G K => T x + 0, 

(2) jeT,^ ran(g) C X. 

Proof 

(1) We give two proofs, the first uses (/LtCuraO), the second the stronger (by Fact I4.2.1H1 (page 175")) (3)) (HU,u). 

(a) We have to show that Y G y, x G Y — /i(F) [i(Y) ^ 0. Suppose then x G /i(^0, this exists, as x G K, and /i(F) = 0, 
so fi(Y) C X, x eY, so by (fiCumO) x G /i(F). 

(b) Consider H {X, x), suppose fi(Y) = 0, x G Y, so Y C a;), so a; G /x(Y) by (77J7, tt). 

(2) By definition, ju(F) C K for all Fe^.D 



Claim 4.2.24 

Let U G x G If. Then 

(1) x G >u(t/) «i£[/A]/e r K .ran(/) n U = 0, 

(2) a; G a*(C/) O a; G £/ A 3/ G T x .ran(f) n i7(Z7,x) = 0. 

Proof 

(1) 

Case 1: W K = 0, thus T x = {0}. 
" ": Take / := 0. 

" <$= " : x G 17 G y, W x = => x G //([/) by definition of W x . 
Case 2: W x ^ 0. 

"=>": Let x G /x(E7) C 17. Consider H(U, x). If/z(Y) G W x , then x G Y-/z(Y), so by (HU,u) tox H{U,x) (i(Y)-H(U,x) + 
0, but fi(U) CUC H(U,x). 

" 4= ": If x G J7 - ji(Z7), so m(Z7) G W x , moreover r T ^ by Remark (page [77]) , (1) and thus fi(U) ^ 0, so 

V/ G r a .ran(/) n E7 ^ 0. 

(2): The Case 1 is as for (1). 

Case 2: " ==> " was shown already in Case 1. 

"■*=": Let x eU — n{U), then by x G if pt(f7) ^ (see proof of Remarkg2231 (page[77|) ), moreover /i([/) C U C 77(Z7, x), 
so V/ G T x .ran(f) n F(f7, x) ^ 0. 

□ (Claim HI (page [TTJ ) 

The following Proposition 14.2.251 (page [77|) is the main positive result of Section 14.2.2.31 (page 170)) and shows how to 
characterize smooth structur es in th e absence of closure under finite unions. The strategy of the proof follows closely the 
proof of Proposition 3.3.4 in |Sch04] . 

Proposition 4.2.25 

Let /i : y — > V(Z). Then there is a y~ smooth preferential structure Z, s.t. for all X G y n(X) = fiz(X) iff \i satisfies 
(u C) and (HU, u) above. 
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Proof 

" => " (HU, u) was shown in Fact 14.2.221 (page 175 )1 . 

Outline of " <= " : We first define a structure Z which represents //, but is not necessarily y— smooth, refine it to Z 1 and 
show that Z' represents fj, too, and that Z' is y~ smooth. 

In the structure Z' , all pairs destroying smoothness in Z are successively repaired, by adding minimal elements: If (y, j) is 
not minimal, and has no minimal (x, i) below it, we just add one such (x, i). As the repair process might itself generate such 
"bad" pairs, the process may have to be repeated infinitely often. Of course, one has to take care that the representation 
property is preserved. 

Construction 4.2.3 

(Construction of Z) 

Let X := {{x, g): x G if, g G T x }, (x', g') ~< (x, g) x' G ran(g), Z := (X, -<). 
Claim 4.2.26 

vuey.n(u)=nz(u) 

Proof 

Case 1: x £ if. Then x g" n(U) and x Hz{U). 
Case 2: x G if. 

By Claim gXH (page 177)) . (1) it suffices to show that for all U G y x G jUz(?7) & x G [7 A 3/ G r x .ran(/) n [7 = 0. Fix 

[/ g y. 

" =>• ": cc G Hz(U) =>■ ex. (x, /) minimal in thus x £ U and there is no (x',f) ~< (x,f), x' G {/, a;' G if. But if 

x' G if, then by Remark 14X231 fpage 177)) . (1), T x , ^ 0, so we find suitable /'. Thus, Vx' G ran(f).x' £ U or x' <£ if. But 
ran(f) C if, so ran(/) n C7 = 0. 

"•<=": If x G f7, / G r x s.t. ran(/) n Z7 = 0, then (x, /) is minimal in X\U. □ (Claim \FFM (page [75)) ) 



We will use in the construction of the refined structure Z' the following definition: 
Definition 4.2.6 

a is called x— admissible sequence iff 

1. cr is a sequence of length < lu, a — {ai : i G lu}, 

2. a G n{^(r): Y ey A x eY - fi(Y)}, 

3. cr l+ i G U{fi(X): X ey A x £ n{X) A ratifa) fll^f)}. 

By 2., do minimizes x, and by 3., if x G fJi(X), and ran(ai) D A 7^ 0, i.e. we have destroyed minimality of x in X, x will 
be above some ?/ minimal in X to preserve smoothness. 

Let £ x be the set of x— admissible sequences, for a G Y, x let """cr^ := lj{ran(<7i) : i G cj}. 

Construction 4.2.4 

(Construction of Z') 

Note that by Remark 14.2.231 fpagelTT[). (1), Yj x ^ 0, if x G if (this does <7o, is trivial as by prerequisite n(X) ^ 0). 
Let X' := {(x, cr): x G if A cr G Z x } and (x', cr'} -< / (x, a) :<^> x' G ^cr"\ Finally, let Z' := (X' , -<'), and /x' := ix^/. 

It is now easy to show that Z' represents /1, and that Z' is smooth. For x G fJ.(U), we construct a special x— admissible 
sequence a x ' U using the properties of H(U,x) as described at the beginning of this section. 

Claim 4.2.27 

For all U G y (J,(U) = nz(U) = n'(U). 
Proof 

If x g" if , then x ^ Hz(U), and x ^ /x'(t/) for any C7. So assume x G if . If x G {/ and x g" nz(U), then for all (x, /) G Af, 
there is (x',f) G A? with (x',f) ~< (x,f) and x' G [/. Let now (x,cr) G A", then (x, ctq) G A^, and let (x',f) -< (x,<to) 
in Z with x' G [/. As x' G if, E x / ^ 0, let cr' G S x /. Then (x',cr'} -<' (x, cr) in Z' , Thus x £ /j'(J7). Thus, for all U G y, 
»'(U)Cnz(U) = n(U). 

It remains to show x G /i(C7) =>i£ 

Assume x G /x(i/) (so x G if ), U G 3^ we will construct minimal cr, i.e. show that there is a x ' U G T, x s.t. cr a: ' C/ n?7 = 0. We 
construct this cr 1 '* 7 inductively, with the stronger property that ran(a^' U ) D H(U, x) = for all i G cj. 
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x G n{U), x G Y - n(Y) => fJ,(Y) - H{U,x) ^ by (HU,u) for H(U,x). Let (Tq' U € n{^(Y") - H(U,x) : Y G y, 
x e Y - fi(Y)}, so ran{al' v ) n H(U, x) = 0. 



By the induction hypothesis, ran(a*' ) f)H (U, x) = 0. Let X G ^ be s.t. x G ran(of' u )nX ^ 0. Thus X g H(U,x), 

so fj,(X)-H(U, x) ± by Fact gXH (page [75]), (1). Let of^ £ - #(£/, a;) : X G y, x G /((I), mn^^JnX ^ 0}, 

so ran(<j x ^) PI H(U, x) — 0. The construction satisfies the a;— admissibility condition. □ 

It remains to show: 

Claim 4.2.28 

Z' is y~ smooth. 

Proof 

Let X G y, (x,er) G X'\X. 

Case 1, x G X - : Then ran(a ) n / 0, let x' G ron(cTo) n/i(X). Moreover, C K. Then for all (x', a') G A" 

(x',a') -< (x, er). But (x',a x ,x ) as constructed in the proof of Claim [4~2.27l (page l78 f is minimal in A"[X. 

Case 2, x G /Lt(-X") = fiz(X) = n'{X) : If (x,cr) is minimal in X'\X, we are done. So suppose there is (x',a') -< (x,<r), 

x' G X. Thus x' G ""a""*. Let x' G ran{<Ji). So x G and ranfo) fll/l. But cr i+ i G n{ju(X'): A' G y A x G /ti(X') 

A ran(ai) fll' / 0}, so X is one of the X', moreover n{X) C A, so there is x" G /«(A~) n ran{ai + i) n X, so for all 
(x",cr") G A"' (x",cr") -< (x, cr). But again (x",a x - x ) as constructed in the proof of Claim [42.271 (pagc [78f is minimal in 
A" "A. 

□ (Claim (page EU) and Proposition [42,251 (page [77]) ) 

We conclude this section by showing that we cannot improve substantially. 
Proposition 4.2.29 

There is no fixed size characterization of fi— functions which are representable by smooth structures, if the domain is not 
closed under finite unions. 

Proof 

Suppose we have a fixed size characterization, which allows to distinguish /i— functions on domains which are not necessarily 
closed under finite unions, and which can be represented by smooth structures, from those whic h ca nnot be represented in 
this way. Let the characterization have a parameters for sets, and consider Example 14. 2. 41 fpage lTTj) with k — j3+l, P>a 
(as a cardinal). This structure cannot be represented, as (/iCuiriK) fails - see Fact I4.2.T51 (page [75)1 . (2.1). As we have only 
a parameters, at least one of the A 7 is not mentioned, say Xg. Without loss of generality, we may assume that 6 = 5' + 1. 
We change now the structure, and erase one pair of the relation, x$ -< xs+i- Thus, n(Xg) — {c. xg, Xg+i}. But now we 
cannot go any more from Xg> to Xg'+i = Xg, as [J.(X$) <2 Xgi. Consequently, the only chain showing that (fiCumoo) fails 
is interrupted - and we have added no new possibilities, as inspection of cases shows. {xg+\ is now globally minimal, and 
increasing /i(A) cannot introduce new chains, only interrupt chains.) Thus, ([iCumoo) holds in the modified example, and 
it is thus representable by a smooth structure, as above proposition shows. As we did not touch any of the parameters, 
the truth value of the characterization is unchanged, which was negative. So the "characterization" cannot be correct. □ 



4.2.2.4 The transitive smooth case 

Unfortunately, (fj,Cumtoo) is a necessary but not sufficient condition for smooth transitive structures, as can be seen in 
the following example. 

Example 4.2.5 

We assume no closure whatever. 
U := {ui,u 2 ,U3,Ui}, fi(U) := {^3,^4} 
Yi := {u 4 ,vi,v 2 ,v 3 ,Vi}, m(Fi) := {w 3 ,w4} 
Y 2 ,i := {u2,v 2 ,Vi}, m(^2,i) := {u 2 ,v 2 } 
^2,2 := {ui,vi,v 3 }, m(*2,2) := {ui,Vi} 

For no A, B n(A) CD (A / B), so the prerequisite of {fxCumta) is false, and (/iCumta) holds, but there is no smooth 
transitive representation possible: Consider Y\. If 1*4 >- V3, then Y 2>2 makes this impossible, if 1*4 )- v±, then Y 2t \ makes 
this impossible. 
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Remark 4.2.30 

(1) The situation does not change when we have copies, the same argument will still work: There is a U— minimal copy 
{v,4,i), by smoothness and Yy, there must be a Yi— minimal copy, e.g. (v3,j) -< (1/4, i). By smoothness and Y2.2, there must 
be a Y2,2 — minimal (uy, k) or (vy, I) below (v3,j). But vy is in Yy, contradicting minimality of (1)3, j), uy is in U, contadicting 
minimality of (7/4, i) by transitivity. If we choose (1)4,, j) minimal below (U4, i), we will work with Y^.i instead of Y"2,2- 

(2) We can also close under arbitrary intersections, and the example will still work: We have to consider U n Yy, U H la 1, 
[7 fl ^2 2) ^2.1 H ^2,2; Yy n ^2,1j ^j. n 5f2,2i there are no further intersections to consider. We may assume n(A) — A for all 
these intersections (working with copies). But then n(A) C B implies ^(^4) = A for all sets, and all (fiCumta) hold again 
trivially. 

(3) If we had hnite unions, we could form A := U U Yy U Y2.1 U ^2,2, then would have to be a subset of {113} by 
(fxPR), so by (jiCUM) 114 $ fJ,(U), a contradiction. Finite unions allow us to "look ahead", without (U), we see desaster 
only at the end - and have to backtrack, i.e. try in our example 5*2,1, once we have seen impossibility via 5-2,2, and discover 
impossibility again at the end. □ 



4.2.2.5 General ranked structures 

Fix f : y->v(Z). 

4.2.2.5.2 The general case We summarize in the following Lemma [4.2. 311 (page lSO)) our results for the general ranked 
case, many of them trivial. 

Lemma 4.2.31 

We assume here for simplicity that all elements occur in the structure. 

(1) If n(X) = 0, then each element x £ X either has infinitely many copies, or below each copy of each x, there is an 
infinite descending chain of other elements. 

(2) If there is no X s.t. x £ fi(X), then we can make infinitely many copies of x. 

(3) There is no simple way to detect whether there is for all x some X s.t. x £ [i{X). More precisely: there is no normal 
finite characterization of ranked structures, in which each x in the domain occurs in at least one fi(X). 

Suppose in the sequel that for each x there is some X s.t. x £ n(X). (This is the hard case.) 

(4) If the language is finite, then 1^1 implies n(X) ^ 0. 
Suppose now the language to be infinite. 

(5) If we admit all theories, then //(M(T)) = M(T) for all complete theories. 

(6) It is possible to have (i(M (<ft)) = for all formulas 4>, even though all models occur in exactly one copy. 

(7) If the domain is sufficiently rich, then we cannot have n(X) = for "many" X. 

(8) We see that a small domain (see Case (6)) can have many X with fi(X) = 0, but if the domain is too dense (see Case 
(7)), then we cannot have many fi(X) = 0. (We do not know any criterion to distinguish poor from rich domains.) 

(9) If we have all pairs in the domain, we can easily construct the ranking. 
Proof 

(1), (2), (4), (5), (9) are trivial, there is nothing to show for (8). 

(3) Suppose there is a normal characterization $ of such structures, where each element x occurs at least once in a set X 
s.t. x £ fJ.(X). Such a characterization will be a finite boolean combination of set expressions $, universally quantified, in 
the spirit of (AND), (RM) etc. 

We consider a realistic counterexample - an infinite propositional language and the sets definable by formulas. We do not 
necessarily assume definability preservation, and work with full equality of results. 

Take an infinite propositional language pi : i < u>. Choose an arbitrary model to, say m \= pi : i < u). 

Now, determine the height of any model to' as follows: ht(m') := the first pi s.t. m(pi) ^ m'(pi), in our example then the 
first pi s.t. ml |= ->pi. Thus, only to has infinite height, essentially, the more different ml is from to (in an alphabetical 
order), the lower it is. 

Make now lu many copies of to, in infinite descending order, which you put on top of the rest. 

$ has to fail for some instantiation, as X does not have the desired property. Write this instantiation of $ wlog. as a 
disjunction of conjunctions: V(A 4>i,j)- 

Each (consistent, or non-empty) component 4>i,j has finite height, more precisely: the minimum of all heigts of its models 
(which is a finite height). Thus, |~ ((f>i,j) will be just the minimally high models of <j>ij in this order. 

Modify now X s.t. to has only 1 copy, and is just (+1 suffices) above the minimum of all the finitely many <f>i t j. Then none 
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(Remark: Obviously, there are two easy generalizations for this ranking: First, we can go beyond to (but also stay below 
to), second, instead of taking just one m as a scale, and which has maximal height, we can take a set M of models: ht(m!) 
is then the first Pi where m'(p.;) is different from all m G M. Note that in this case, in general, not all levels need to be 
filled. If e.g., mo, mi G M, and mo(po) = false, mi(po) = true, then level will be empty.) 

(6) Let the pi again define an infinite language. Denote by pf the set of all +Pj, where j > i. Let T be the usual tree 
of models (each model is a branch) for the pi, with an artificial root *. Let the first model (— branch ) be * + , i.e. the 
leftest branch in the obvious way of drawing it. Next, we choose ~^Pq , i.e. we go right, and then all the way left. Next, 
we consider the 4 sequences of +/ — po, +/ — pi, two of them were done already, both ending in pj, and choose the 
remaining two, both ending in -ip^ , i.e. the positive prolongations of po, -ipi and ^po, ~~*Pi- Thus, at each level, we take all 
possible prolongations, the positive ones were done already, and we count those, which begin negatively, and then continue 
positively. Each formula has in this counting arbitrarily big models. 

This is not yet a full enumeration of all models, e.g. the branch with all models negative will never be enumerated. But it 
suffices for our purposes. 

Reverse the order so far constructed, and put the models not enumerated on top. Then all models are considered, and 
each formula has arbitrarily small models, thus n((f) — for all (j>. 

(7) Let the domain contain all singletons, and let the structure be without copies. The latter can be seen by considering 
singletons. Suppose now there is a set X in the domain s.t. (J,(X) = 0. Thus, each x G X must have infinitely many x' G X 
x' -< x. Suppose V{X) is a subset of the domain. Then there must be infinite Y G V[X) s.t. fi(Y) ^ : Suppose not. Let 
-< be the ranking order. Choose arbitrary x G X. Consider X 1 := {x' G X : x -< x'}, then x G ju(X'), and not all such X 1 
can be finite - assuming X is big enough, e.g. uncountable. 

□ 



We conclude by giving an example of a definability preserving non-compact preferential logic - in answer to a question by 
D.Makinson (personal communication): 

Example 4.2.6 

Take an infinite language, pi, i < uj. Fix one model, m, which makes po true (and, say, for definiteness, all the others true, 
too), and m' which is just like m, but it makes po false. Well-order all the other p$ — models, and all the other ->po — models 
separately. 

Construct now the following ranked structure: 

On top, put m, directly below it m' . Further down put the bloc of the other -ipo — models, and at the bottom the bloc of 
the other po — models. 

As the structure is well-ordered, it is definability preserving (singletons are definable). 
Let T be the theory defined by m,m', then T |~ ~^Pq. 

Let 4> be s.t. M(T) C M(4>), then M(cf>) contains a po — model other than m, so <j> |~ po. 
□ 



4.2.2.6 Smooth ranked structures 

We assume that all elements occur in the structure, so smoothness and n(X) ^ for X ^ coincide. 

The following abstract definition is motivated by: 

« (u), the set of a' € If which have same rank as u, 

< (u), the set of u' G W which have lower rank than u, 

y (u), the set of u' G W which have higher rank than u, 

all other v! G W will by default have unknown rank in comparison. 

We can diagnose e.g. u' G~ (u) if u, u' G fi(X) for some X, and v! G>~ (u) if u G fi(X) and v! G X — fi(X) for some X. 

If we sometimes do not know more, we will have to consider also ^ (u) and y (u) - this will be needed in Section 18.1.21 
(page[T53|), where we will have only incomplete information, due to hidden dimensions. 

All other u' G W will by default have unknown rank in comparison. 

Definition 4.2.7 

(1) Define for each u G W three subsets of W ~ (u), -< (it), and y (u). Let O be the set of all these subsets, i.e. 
O := {» («), -< (u), y («) : u G W} 

(2) We say that O is generated by a choice function / 
iff 

(i) vf/ g yy x ,x' g f(u) x' gw (x), 
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(3) O is said to be representable by a ranking iff there is a function / : W — > (O, <) into a total order (O, <) s.t. 

(1) «' 6W (it) /(it') = /(it) 

(2) u' (it) /(it') « /(u) 

(3) u' (it) /(it') ► /(it) 

(4) Let C(O) be the closure of O under the following operations: 

• u G« (u), 

• if u' e« (u), then « (it) =» (u'), -< («) =-< («')> >- ( u ) =>" («')> 

• it' (it) iff it G>- (it'), 

• u («'), (u") it (u"), 

or, equivalcntly, 

• u (it') -< (it') C-< (u). 

Note that we will generally loose much ignorance in applying the next two Facts. 
Fact 4.2.32 

A partial (strict) order on W can be extended to a total (strict) order. 
Proof 

Take an arbitrary enumeration of all pairs a, b of W : (a, b){ : i £ n. Suppose all (a, b)j for j < i have been ordered, and we 
have no information if a -< b or a « b or a >- 6. Choose arbitrarily a -< 6. A contradiction would be a (finite) cycle involving 
-< . But then we would have known already that b ^ a. □ 

We use now a generalized abstract nonsense result, taken from ILMSOlj . which must also be part of the folklore: 
Fact 4.2.33 

Given a set X and a binary relation R on X, there exists a total preorder (i.e. a total, reflexive, transitive relation) S on 
X that extends R such that 

Vx, ?/ 6 X(xSy, ySx =4> xR*y) 

where i?* is the reflexive and transitive closure of R. 
Proof 

Define x = y iff xR*y and yR*x. The relation = is an equivalence relation. Let [x] be the equivalence class of x under = . 
Define [x] ^ [y] iff xR*y. The definition of < docs not depend on the representatives x and y chosen. The relation < on 
equivalence classes is a partial order. 

Let < be any total order on these equivalence classes that extends -<, by above Fact 1.08. 

Define xSy iff [x] < [y]. The relation S is total (since < is total) and transitive (since < is transitive) and is therefore a 
total preorder. It extends R by the definition of ^ and the fact that < extends ;< . Suppose now xSy and ySx. We have 
[x] < [y] and [y] < [x] and therefore [x] = [y] by antisymmetry. Therefore x = y and xR*y. □ 



Fact 4.2.34 

O can be represented by a ranking iff in C{0) the sets w (u), -< (u), >- (it) are pairwise disjoint. 
Proof 

(Outline) By the construction of C(0) and disjointness, there are no cycles involving -< . Extend the relation by Fact l4.2.3"3l 
(page[82]). Let the w (it) be the equivalence classes. Define w (it) (u') iff it G^; (u'). □ 



Proposition 4.2.35 

Let / : y — » V(W). / is representable by a smooth ranked structure iff in C(O) the sets w (u), -< (it), >~ (it) are pairwise 
disjoint, where O is the system generated by /, as in Definition 14.2.71 (page l8Tj) . 



Proof 

If the sets are not pairwise disjoint, we have a cycle. If not, use Fact I4.2."3~4l (page 152")) . □ 



Chapter 5 

Preferential structures - Part II 



5.1 Simplifications by domain conditions, logical properties 

5.1.1 Introduction 

We examine here simplifications made possible by stronger closure conditions of the domain y, in particular (U). 

For general preferential structures, there is nothing to show - there were no prerequisites about closure of the domain. 

The smooth case is more interesting. The work for the not necessarily transitive case was done already, and, as we did not 
know how to do better, we give now directly the result for the smooth transitive case, using in an essential way (U). 

5.1.2 Smooth structures 

For completeness' sake and for the reader's convenience, we will just repeat here our result from Sch04 , with the slight 
improvement that we do not need any more, and the codomain need not be y any more. The central condition is, of 
course, (U), which we use now as we prepare the classical propositional case, where we have V. 

5.1.2.0.1 Discussion of the smooth and transitive case 

In a certain way, it is not surprising that transitivity does not impose stronger conditions in the smooth case either. 
Smoothness is itself a weak kind of transitivity: If an element is not minimal, then there is a minimal element below it, 
i.e., x >- y with y not minimal is possible, there is z' -< y, but then there is z minimal with x >- z. This is "almost" x >- z', 
transitivity. 

To obtain representation, we will combine here the ideas of the smooth, but not necessarily transitive case with those of the 
general transitive case - as the reader will have suspected. Thus, we will index again with trees, and work with (suitably 
adapted) admissible sequences for the construction of the trees. In the construction of the admissible sequences, we were 
careful to repair all damage done in previous steps. We have to add now reparation of all damage done by using transitivity, 
i.e., the transitivity of the relation might destroy minimality, and we have to construct minimal elements below all elements 
for which we thus destroyed minimality. Both cases are combined by considering immediately all Y s.t. x 6 Y — H(U). Of 
course, the properties described in Fact I4.2."2"01 fpage [75 )1 play again a central role. 

The (somewhat complicated) construction will be commented on in more detail below. 

Note that even beyond Fact 14.2.201 (page [75]) . closure of the domain under finite unions is used in the construction of 
the trees. This - or something like it - is necessary, as we have to respect the hulls of all elements treated so far (the 
predecessors), and not only of the first element, because of transitivity. For the same reason, we need more bookkeeping, 
to annotate all the hulls (or the union of the respective U's) of all predecessors to be respected. One can perhaps do with 
a weaker operation than union - i.e. just look at the hulls of all U's separately, to obtain a transitive construction where 
unions are lacking, see the case of plausibility logic below - but we have not investigated this problem. 

To sum mari ze: we combine the ideas from the transitive general case and the simple smooth case, using the crucial Fact 
14.2.201 fpage[75 [) to show that the construction goes through. The construction leaves still some freedom, and modifications 
are possible as indicated below in the course of the proof. The construction is perhaps the most complicated in the entire 
book, as it combines several ideas, some of which are already somewhat involved. If necessary, the proof can certainly still 
be elaborated, and its main points (use of a suitable H(U) to avoid U, successive repair of damage done in the construction, 
trees as indexing) may probably be used in other contexts, too. 

5.1.2.0.2 The construction: 

Recall that y will be closed under finite unions in this section, and let again fi : y — > V(Z). 
Proposition [5JTTT] (page [83]) is the representation result for the smooth transitive case. 

Proposition 5.1.1 

Let y be closed under finite unions, and // : y — > V(Z). Then there is a y— smooth transitive preferential structure Z, s.t. 
for all X e y fi(X) = Hz{X) iff n satisfies (fi C), (pPR), {fxCUM). 
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5.1.2.0.3 The idea: 

We have to adapt Construction 14.2.41 (page [78]) (using x— admissible sequences) to the transitive situation, and to our 
construction with trees. If (0, x) is the root, ao S H{/i(Y) : x G Y — /j,(Y)} determines some children of the root. To preserve 
smoothness, we have to compensate and add other children by the cr^+i : Cj+i G H{fi(X) : x G n(X), ran(ai) fl X ^ 0}. On 
the other hand, we have to pursue the same construction for the children so constructed. Moreover, these indirect children 
have to be added to those children of the root, which have to be compensated (as the first children are compensated by 
<7x) to preserve smoothness. Thus, we build the tree in a simultaneous vertical and horizontal induction. 

This construction can be simplified, by considering immediately all Y G y s.t. lef^ H(U) - independent of whether 
x n(Y) (as done in ao), or whether x G f^(Y), and some child y constructed before is in Y (as done in the Ci+i), or 
whether x G fJ-{Y), and some indirect child y of x is in Y (to take care of transitivity, as indicated above). We make this 
simplified construction. 

There are two ways to proceed. First, we can take as <* in the trees the transitive closure of <. Second, we can deviate 
from the idea that children are chosen by selection functions /, and take nonempty subsets of elements instead, making 
more elements children than in the first case. We take the first alternative, as it is more in the spirit of the construction. 

We wil l supp ose for simplicity that Z — K - the general case in easy to obtain by a technique similar to that in Section 
14.2.2.11 fpage [M|) . but complicates the picture. 

For each x G Z, we construct trees t x , which will be used to index different copies of x, and control the relation -< . 
These trees t x will have the following form: 

(a) the root of t is (0, x) or (U, x) with U G y and x G n(U), 

(b) all other nodes are pairs (Y,y), Y G y, y G (J,(Y), 

(c) ht(t) < w, 

(d) if (Y, y) is an element in t x , then there is some y(y) C {W G y : y G W}, and / G Il{fi(W) : W G y{y)} s.t. the set of 
children of (Y, y) is {(Y U W, f(W)} : W G y(y)}. 

The first coordinate is used for bookkeeping when constructing children, in particular for condition (d). 
The relation -< will essentially be determined by the subtree relation. 

We first construct the trees t x for those sets U where x G /J.(U), and then take care of the others. In the construction for 
the minimal elements, at each level n > 0, we may have several ways to choose a selection function f n , and each such choice 
leads to the construction of a different tree - we construct all these trees. (We could also construct only one tree, but then 
the choice would have to be made coherently for different x,U. It is simpler to construct more trees than necessary.) 
We control the relation by indexing with trees, just as it was done in the not necessarily smooth case before. 

Definition 5.1.1 

If t is a tree with root (a, b), then t/c will be the same tree, only with the root (c, b). 

Construction 5.1.1 

(A) The set T x of trees t for fixed x: 

(1) Construction of the set T/i x of trees for those sets U G y, where x G n(U) : 

Let U G y, x G fJ-{U). The trees tjj, x G Tfi x are constructed inductively, observing simultaneously: 

If (t/fi+l) x n+i) is a child of (U n ,x n ), then 

(a) x n+1 G n(U n+ i) - H(U n ), and (b) U n C U n+1 . 

Set Uq :— U, xq := x. 

Level 0: (Uq, xq). 

Level n—*n + l: Let (U n ,x n ) be in level n. Suppose Y n+1 G y, x n G Y n+ i, and Y n+1 % H(U n ). Note that fi(U n U Y n+ i) — 
H(U n ) ^ by Fact 14.2.201 fpagelTSD. (5.5), and /j(!7 n uy n+1 )-fl(t/ n ) C fj,(Y n+1 ) by Fact 14.2.201 f page 1751. (3.3). Choose 
f n+ i G U{fj,(U n U Y n+ i) — H(U n ) : Y n+ i G y, x n G Y n+ i % H(U n )} (for the construction of this tree, at this element), and 
let the set of children of (U n , x n ) be {(U n U Y n+1 , f n+1 (Y n+1 )) : Y n+1 G y, x n G Y n+1 % H(U n )}. (If there is no such Y n+1 , 
(U n ,x n ) has no children.) Obviously, (a) and (b) hold. 

We call such trees U, £— trees. 

(2) Construction of the set T' x of trees for the nonminimal elements. Let i£Z. Construct the tree t x as follows (here, one 
tree per x suffices for all U): 

Level 0: (Q,x) 

Level 1: Choose arbitrary / G U{fi(U) : x G U G y}. Note that U ^ -> fj,(U) ^ by Z = K (by Remark (page ETJ, 
(1)). Let {(U, f(U)) : x G U G y} be the set of children of < 0, x > . This assures that the element will be nonminimal. 

Level > 1: Let (U,f(U)) be an element of level 1, as f(U) G n(U), there is a tv,f(U) S ^M/ft/)- Graft one of these trees 
tu,f(u) £ Tnf(u) at (U, f{U)) on the level 1. This assures that a minimal element will be below it to guarantee smoothness. 

Finally, let T, x := T^i x U T' x . 

(B) The relation < between trees: For x,y G Z, t G T x , t' G T y , set t [> t' iff for some Y (Y, y) is a child of the root (X, x) 
in t, and t' is the subtree of t beginning at this (Y, y). 

(C) The structure Z: 
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The rest of the proof are simple observations. 
Fact 5.1.2 

(1) If tu,x is an U, x— tree, (U n , x n ) an element of tjj, x > (U m , x m ) a direct or indirect child of (U n , x n ), then x m H{U n ). 

(2) Let (Y n , y n ) be an element in tjj,x G Tfi x , t' the subtree starting at (Y n , y n ), then t' is a Y n , y n — tree. 

(3) -< is free from cycles. 

(4) If tu,x is an U, a;— tree, then (x, tu, x ) is -4 —minimal in Z\U. 

(5) No {x, t x ), t x G T' x is minimal in any Z\U, U G 3^. 

(6) Smoothness is respected for the elements of the form (x,tu, x )- 

(7) Smoothness is respected for the elements of the form (x,t x ) with t x G T' x . 

(8) ii = ii z - 

Proof 

(1) trivial by (a) and (b). 

(2) trivial by (a). 

(3) Note that no (x, t x ) t x G T' x can be smaller than any other element (smaller elements require U ^ at the root). So 
no cycle involves any such (x,t x ). Consider now (x,tu,x)> tu,x G T^, x . For any (y,tv, v ) -< (x,tjj, x )> V H{U) by (1), but 
x G n(U) C if(C7), so a; ^ y. 

(4) This is trivial by (1). 

(5) Let x U E y, then / as used in the construction of level 1 of t x chooses y G /x(J7) ^ 0, and some (y,tu, y ) is in 
and below (x, i^}. 

(6) Let a; 6 4 6 J, we have to show that either {x,tjj tX ) is minimal in Z\A, or that there is (y,t y ) -< (x,tu^ x ) minimal in 
Z[A Case 1,AC H(Z7): Then {x,t U)X ) is minimal in again by (1). Case 2, A g i?(C/): Then A is' one of the Y\ 
considered for level 1. So there is {UUA, fi{A)) in level 1 with f x (A) G fJ,(A) C A by Fact 14.2.201 (page[75 l) . (3.3). But note 
that by (1) all elements below (UUA, fi(A)) avoid H(Ul)A). Let t be the subtree of tu, x beginning at (U Li A, fx(A)}, then 
by (2) t is one of the UUA, fi(A)-trees, and (fi(A),t) is minimal in Z\UUA by (4), so in Z\A, and (fi(A),i) -< (x,tu, x )- 

(7) Let x e A e y, (x, t x ), t x G T x , and consider the subtree t beginning at (A, f(A)), then t is one of the A, f(A)— trees, 
and (f(A),t) is minimal in by (4). 

(8) Let a; G n(U). Then any (x,tu, x ) is -«< —minimal in Z[i7 by (4), so a; G fiz(U). Conversely, let x G f/ — n(U). By (5), 
no (x,t x ) is minimal in £7. Consider now some (x,iv>) £ Z, so i £ As a; G fJ — fj,(U), U % H(V) by Fact I4.2."2^1 
(page [75]), (5.4). Thus f/ was considered in the construction of level 1 of ty x - Let t be the subtree of ty x beginning at 
(VUUJtiU)), by n(VUU)-H{V)C ^{U) (Fact MM (page [75]), (3.3)), /!([/)£#) C [/, and (h{U)\t) ~< (x,ty x ). 

□ (Fact [5X2 (page [55]) and Proposition [5XJ (page [S3]) ) 



5.1.3 Ranked structures 

We summarize for completess' sake results from |Sch04j : 

First two results for the case without copies (Proposition [5j~2] (page [55]) and Proposition ^. 1.41 fpage l55]) ). 
Proposition 5.1.3 

Let y C V(U) be closed under hnite unions. Then (/i C), (/i0), (/i =) characterize ranked structures for which for all 
le^I/fl^ fi < (X) 7^ hold, i.e. (/i C), (/i0), (/i =) hold in such structures for /i<, and if they hold for some /i, we 
can find a ranked relation < on U s.t. /i = /i<. Moreover, the structure can be choosen y— smooth. 

For the following representation result, we assume only (/x0/m), but the domain has to contain singletons. 

Proposition 5.1.4 

Let y C V(U) be closed under finite unions, and contain singletons. Then (fi C), (/i0/m), (/i =), (fi g) characterize 
ranked structures for which for all finite X G 3^ X ^ =>■ n<{X) ^ hold, i.e. (^ C), (^0/in), (/i =), (/i G) hold in such 
structures for /i<, and if they hold for some \x, we can find a ranked relation < on U s.t. = fi < . 

Note that the prerequisites of Proposition l5.1.4l ( page [55]) hold in particular in the case of ranked structures without copies, 
where all elements of U are present in the structure - we need infinite descending chains to have fi(X) = for X ^ 0. 

We turn now to the general case, where every element may occur in several copies. 
Fact 5.1.5 

(1) (it C) + (uPR) + («=) + (uU) + (u g) do not imply representation by a ranked structure. 
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(fi || oo) n(\J{Ai :iel}) = IJ{m(A) : i G J'} for some T C 7. 
will not always hold in ranked structures. 

We assume again the existence of singletons for the following representation result. 
Proposition 5.1.6 

Let y be closed under finite unions and contain singletons. Then (/i C) + (/uPi?) + (/i ||) + (/iU) + (/i g) characterize ranked 
structures. 

5.1.4 The logical properties with definability preservation 

We repeat for completeness' sake: 
Proposition 5.1.7 

Let |~ be a logic for C. Set T M :— Th(^,j^(M(T))), where Ai is a preferential structure. 

(1) Then there is a (transitive) definability preserving classical preferential model Ai s.t. T = T M iff 
(LLE) T = T =>f = T f , 

(CCL) T is classically closed, 
(SC) Tcf, 

(PR) TUT 7 CTUT' 
for all T, T' C £. 

(2) The structure can be chosen smooth, iff, in addition 

(CUM) Tcrcf = P 
holds. 

The proof is an immediate consequence of Proposition 15.1.81 fpage [86]) and Proposition [5TTT] (page [83]). □ 
Proposition 5.1.8 

Consider for a logic |~ on £ the properties 

(LLE) T = T => T = F, 

(CCL) T is classically closed, 

(SC) Tcf, 

(PR) TUT 7 CTUT', 

(CUM) TCTcf = f^ 
for all T, T' C £, 

and for a function /n : J)/; — > V(Mc) the properties 

(/idp) /i is definability preserving, i.e. fi(M(T)) = M(T') for some T' 

0" C) c X, 

(/iPE) icy => ^(T) nic M (x), 
(pCUM) fi(x) cra^ ^(X) = ^(y) 

for all X,Y e D c . 
It then holds: 

(a) If (j, satisfies (/idp), (/z C), (^Pi?), then |- defined by T := T» := Th[fi(M(T))) satisfies (LLE), (CCL), (SC), (PR). If 
\x satisfies in addition (/iCUM), then (CUM) will hold, too. 

(b) If ^ satisfies (LLE), (CCL), (SC), (PR), then there is n : D c -> P(M £ ) s.t. T = T» for all T C £ and /i satisfies 
{fidp), (fj, C), ( M PP). If, in addition, (CUM) holds, then (fiCUM) will hold, too. 

The proof follows from Proposition 12.3.41 (page |3"6"|) . □ 



5.2. A-RANKED STRUCTURES 

5.2 .4-ranked structures 



87 



We do no w th e com plete ness proofs for the preferential part of hierarchical conditionals. All motivation etc. will be found 
in Section O (page [Ti5]) . 

First the basic semantical definition: 
Definition 5.2.1 

Let A be a fixed set, and A a finite, totally ordered (by <) disjoint cover by non-empty subsets of A. 

For x G A, let rg(x) the unique A G A such that x G A, so rg(x) < rg(y) is defined in the natural way. 

A preferential structure (X, -<) (<Y a set of pairs (x, i)) is called „4— ranked iff for all x,x' rg(x) < rg(x') implies {x, i) -< 
(x',i') for all (x,i),(x',i') G 

5.2.1 Representation results for ^4-ranked structures 
5.2.1.1 Discussion 

The not necessarily smooth and the smooth case will be treated differently. 

Strangely, the smooth case is simpler, as an added new layer in the proof settles it. Yet, this is not surprising when looking 
closer, as minimal elements never have higher rank, and we know from (fj,CUM) that minimizing by minimal elements 
suffices. All we have to add that any element in the minimal layer minimizes any element higher up. 

In the simple, not necessarily smooth, case, we have to go deeper into the original proof to obtain the result. 

The following idea, inspired by the treatment of the smooth case, will not work: Instead of minimizing by arbitrary 
elements, minimize only by elements of minimal rank, as the fo llowin g exa mp le shows. If it worked, we might add just 
another layer to the original proof without (//-4), (see Definition 15.2.21 (page 157)) ), as in the smooth case. 

Example 5.2.1 

Consider the base set {a, b, c}, n({a, b, c}) = {&}, (J.({a, b}) = {a, &}, ju({a, c}) = 0, /i({i>, c}) = {b}, A defined by {a, b} < {c}. 
Obviously, (pA) is satisfied, ju can be represented by the (not transitive!) relation a -< c ~< a, b -< c, which is A— ranked. 
But trying to minimize a in {a, 6, c} in the minimal layer will lead to b -< a, and thus a ^ A*({a, b}), which is wrong. 
□ 



The proofs of the general and transitive general case are (minor) adaptations of the proofs in Section 14.2.21 (page [66]) . 
For the smooth case, we only have to add a supplementary layer in the end (Fact l5T2~8l (page [59]) ), which will make the 
construction „4— ranked. 

In the following, we will assume the partition A to be given. We could also construct it from the properties of fi, but this 
would need stronger closure properties of the domain. The construction of A is more difficult than the construction of the 
ranking in fully ranked structures, as x G n{X), y G X — fx(X) will guarantee only rg(x) < rg(y), and not rg(x) < rg(y), 
as is the case in the latter sit uation. This corr esponds to the separate treatment of the a and other formulas in the logical 
version, discussed in Scction r5.2.1.4l fpagc lTO)) . 

5.2.1.2 ^l-ranked general and transitive structures 

5.2.1.2.1 Introduction We will show here the following representation result: 
Let A be given. 

An operation /j, : y — > V(Z) is representable by an A— ranked preferential structure iff [i satisfies (/i C), (/iPR), (fiA) 
fProposition l5.2.2l (page 155)) ), and, moreover, the structure can be chosen transitive (Proposition 15.2 4l (page 155)) ). 

Note that we carefully avoid any unnecessary assumptions about the domain y C V(Z) of the function /i. 

Definition 5.2.2 

We define a new condition: 

Let A be given as defined in Definition 1 5. 2. II (page [87]). 

(jjlA) If X G y, A, A' G A, A < A', X n A ^ 0, X n A' ^ then fx(X) n A' = 0. 
This new condition will be central for the modified representation. 

5.2.1.2.2 The basic, not necessarily transitive, case 
Corollary 5.2.1 

Let n : y -> V{Z) satisfy (/j C), (uPR), (pA), and let U G y. 



88 CHAPTER 5. PREFERENTIAL STRUCTURES - PART II 

Proof 

By (fj,A) x n(U), thus by Claim [42JJJ (page [66j) V/ E TL x .ran(f) n U ^ 0. □ 



Proposition 5.2.2 

Let ^4 be given. 

An operation /j, : y — > P(Z) is representable by an .4— ranked preferential structure iff n satisfies (// C), (/iPR), {(xA). 
Proof 

One direction is trivial. The central argument is: If a -< 6 in X, and IC7, then a -< 6 ha Y, too. 

We turn to the other direction. The preferential structure is defined in Construction 15.2.11 (page 188]) , Claim 15.2.31 (page 
[88]) shows representation. 

Construction 5.2.1 

Let X := {(a;,/) : x E Z A / G n a }, and (a;',/') -< (x, /) :<-> x' e ran(f) or rg(x') < rgt(x). 
Note that, as A is given, we also know rg(x). 
Let Z := (A*, -<). 

Obviously, Z is .4— ranked. 

Claim 5.2.3 

ForUEy, n(U) = Hz(U). 
Proof 

By Claim ET~2~TT1 (page |gBj| . it suffices to show that for all U G y x G £t.z(f7) ^i£t/ and 3/ G II x .ran(/) n J7 = 0. So let 
[/ G y. 

" — > ": If x G fiz(U), then there is (x, /} minimal in <Y[C7 - where A"ff7 := {(a;, i) G X : x G f }), so x E U, and there is no 
(a/, /') -< (x, /}, x' G U, so by il x / ^ there is no x' G ran(f), x' E U, but then ran(f) n £/ = 0. 

" <— " : If x G 17, and there is / G II X , ran(f) n J7 = 0, then by Corollary 15. 2. II (page 157)) ■ there is no x' G J7, rg(x') < rg(x), 
so (x, /) is minimal in A? |~L7 

□ (Claim IS~2T3l (page 188")) and Proposition IS~2T2l (page [55)1 ) 



5.2.1.2.3 The transitive case 

Proposition 5.2.4 

Let A be given. 

An operation fi : J? — > V(Z) is representable by an .4— ranked transitive preferential structure iff /i satisfies (/x C), (jj,PR), 

(M)- 

Construction 5.2.2 

(1) For x G Z, let T x be the set of trees t x s.t. 

(a) all nodes are elements of Z, 

(b) the root of t x is x, 

(c) height(t x ) < lo, 

(d) if y is an element in t x , then there is / G IL, := H{Y e J: y E Y — mOO} s.t. the set of children of y is 
ran(f) U {y' G Z : rg(y') < r#(y)}. 

(2) For x,y E Z, t x E T x , t y E T y , set > i y iff y is a (direct) child of the root x in t x , and t 9 is the subtree of t x beginning 
at y. 

(3) Let Z := ( {(x,t x ) : x G Z, t x G T x }, (x,^) >- iff * x >i a }. 
Fact 5.2.5 

(1) The construction ends at some y iff y y = and there is no y' s.t. rg(y') < rg(y), consequently T x — {x} iff 34 = 
and there are no x' with lesser rang. (We identify the tree of height 1 with its root.) 

(2) We define a special tree tc x for all x : For all nodes y in tc x , the successors are as follows: if y y ^ 0, then z is an 
successor iff z = y or rg(z) < rg(y); if y y = 0, then z is an successor iff rg(z) < rg(y). (In the first case, we make f E y y 
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(3) If / G H x , then the tree tf x with root x and otherwise composed of the subtrees tc y for y G ran(f)U{y' 
is an element of T x . (Level of tf x has x as element, the t' y s begin at level 1.) 

(4) If y is an element in t x and t y the subtree of t x starting at y, then t y G T y . 

(5) (x,t x ) >~ (y,t y ) implies y G ran(f) U {x' : rg(x') < rg(x)} for some / G 
□ 

Claim 15.2.61 (page I59"| shows basic representation. 
Claim 5.2.6 

Proof 

By Claim |4T2~TT1 (page 166]) . it suffices to show that for all U G y x G <-> x G C/ A 3/ G n^. ran(/) n ?7 = 0. 

Fix u g y. 

" — * ": x G ^z{U) — > ex. (x,t x ) minimal in thus x £ U and there is no (y,t y ) G -Z, (y,t y ) -< {x,t x ), y £ U. Let / 

define the first part of the set of children of the root x in t x . If ran(f) PiU 7^ 0, if y G C/ is a child of x in i^, and if i y is the 
subtree of t x starting at y, then t y G T y and (y,t y ) -< (x,t x ), contradicting minimality of (x, ta;) in Z\U. So ran(f)P\U = 0. 

" <- ": Let i£[/, and 3/ G n a; .ran(/) n C/ = 0. By Corollary EO (page EU), there is no x' G U, rg(x') < rg(x). If 34 = 0, 

then the tree tc x has no >— successors in U, and (x, tc x ) is > minimal in Z \U. If 34 7^ and / G H x s.t. ran(f) D U = 0, 

then (x,tf x ) is again > minimal in 

□ 



We consider now the transitive closure of Z. (Recall that -<* denotes the transitive closure of -< .) Claim HT2. 71 (page [89]) 
shows that transitivity does not destroy what we have achieved. The trees tf x play a crucial role in the demonstration. 

Claim 5.2.7 

Let Z' := ( {{x, t x ) : x G Z,t x G T x }, (x,t x ) >- (y,t y ) iff t x >* t y ). Then \i z = Hz' ■ 
Proof 

Suppose there is U G y, x G U, x G nz(U), x £" [iz'{U). Then there must be an element (x, t x ) G Z with no (x, t x ) >- (y, t y ) 
for any y £ U. Let / G H x determine the first part of the set of children of x in t x , then ran(f) n ?7 = 0, consider £_/-. 
All elements m / 1 of tf x are already in ran(f), or rg(w) < rg(x) holds. (Note that the elements chosen by rang in 
tf x continue by themselves or by another element of even smaller rang, but the rang order is transitive.) But all w s.t. 
rg(w) < rg(x) were already successors at level 1 of x in tf x . By Corollary 15.2.11 ( page [57]) . there is no w G U, rg(w) < rg(x). 
Thus, no element 7^ x of tf x is in U. Thus there is no (z,t z ) -<* (x,tf x ) in Z with z G U, so (x,tf x ) is minimal in Z'[U, 
contradiction. 

□ (Claim [5T2T71 (page 1551 and Proposition (page [55]) ) 



5.2.1.3 „4-ranked smooth structures 

All smooth cases have a simple solution. We use one of our existing proofs for the not necessarily A— ranked case, and add 
one litte result: 

Fact 5.2.8 

Let (fJ-A) hold, and let Z = -<) be a smooth preferential structure representing /1, i.e. /1 = /iz- 
Suppose that 

(x,i) -< (y,j) implies rg(x) < rg(y). 

Define Z' := (X, z) where {x,i) Z {y,j) iff (x,i) -< (y,j) or rg(x) < rg(y). 

Then Z' is A— ranked. 

Z' is smooth, too, and [iz — ^z' ='■ M '• 

In addition, if -< is free from cycles, so is Z, if -< is transitive, so is IZ . 
Proof 

A— rankedness is trivial. 

Suppose (x,i) is -< —minimal, but not IZ —minimal. Then there must be (y,j) Z (x, i), (y,j) -fc (x,i), y G X, so 



89 

: rg(y') < rg(y)} 



90 CHAPTER 5. PREFERENTIAL STRUCTURES - PART II 

By prerequisite, there cannot be any cycle involving only -<, but the rang order is free from cycles, too, and -< respects the 
rang order, so C is free from cycles. 

Let -< be transitive, so is the rang order. But if (x,i) -< {y,j) and rg(y) < rg(z) for some (z,k), then by prerequisite 

rg(x) < rg(y), so rg(x) < rg(z), so (x,i) C (z,k) by definition. Likewise for rg(x) < rg(y) and (y,j) ~< (z,k). 

□ 

All that remains to show then is that our constructions of smooth and of smooth and transitive structures satisfy the 
condition 

(x,i) -< (y,j) implies rg(x) < rg(y). 

Proposition 5.2.9 

Let A be given. 

Let - for simplicity - y be closed under finite unions, and fi : y — » P(Z). Then there is a y— smooth A— ranked preferential 
structure Z, s.t. for all X G y fi(X) = fiz(X) iff fi satisfies (fi C), (fiPR), (jiCUM), (fJ,A). 

Proof 

Consider the construction in the proof of Proposition 14.2.251 (page I77|) . We have to show that it respects the rang order 
with respect to A, i.e. that (x',a') -<! (x,a) implies rg(x') < rg(x). This is easy: By definition, x' £ U{ran(<7i) : i G uj}. If 
x' G ran(ao), then for some Y x' £ fJ>(Y), x G Y — fi{Y), so rg(x') < rg(x) by (nA). If x' G ran(<Ji), i > 0, then for some 
X x',X G [i(X), so rg(x) — rg(x') by ([lA). 

□ (Proposition E^ll (page HO]) ) 



Proposition 5.2.10 

Let A be given. 

Let - for simplicity - y be closed under finite unions, and /i : 3^ — *• 'P(Z)- Then there is a 3^—smooth A~ ranked transitive 
preferential structure Z, s.t. for all X G y n{X) = ^z(X) iff \i satisfies (/x C), (fxPR), (fiCUM), (fJ,A). 

Proof 

Consider the construction in the proof of Proposition 15 . 1 . ll fpage !551) . 
Thus, we only have to show that in Z defined by 

Z := ( {(x,t x ) : x G Z,t x G T x }, (x,t x ) y (y,t y ) iff t x >* t y }, t x \> t y implies rg(y) < rg(x). 
But by construction of the trees, x n G i^+i, and x n +\ G n(U n U Y n+ i), so rg(a; rl+ i) < rg(x n ). 
□ (Proposition 15 . 2 . 101 (page [9H| ) 



5.2.1.4 The logical properties with definability preservation 

First, a small fact about the A. 

Fact 5.2.11 

Let A be as above (and thus finite). Then each Ai is equivalent to a formula aj. 
Proof 

We use the standard topology and its compactness. By definition, each M{Ai) is closed, by finiteness all unions of such 
M(Ai) are closed, too, so C(M(Ai)) is closed. By compactness, each open cover Xj : j G J of the clopen M(Ai) contains 
a finite subcover, so also [J{M(Aj) : j ^ i} has a finite open cover. But the M((f>), <f> a formula form a basis of the closed 
sets, so we are done. □ 

Proposition 5.2.12 

Let |~ be a logic for C Set T M :— Th(fiM(M(T))), where M is a preferential structure. 

(1) Then there is a (transitive) definability preserving classical preferential model Ai s.t. T — T M iff 
(LLE), (CCL), (SC), (PR) hold for all T,T' C £. 

(2) The structure can be chosen smooth, iff, in addition 
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(3) The structure can be chosen A— ranked, iff, in addition 

(A— min) T \f ->ai and T \f ->aj, i < j implies T I — >ctj 
holds. 
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The proof is an immediate consequence of Proposition 15. 2.131 (page [9Tj) and the respective above results. This proposition 
(or its analogue) was mostly already shown in |Sch92| and [Sch96-1 and is repeated here for completeness' sake, but with 
a new and partly stronger proof. 

Proposition 5.2.13 

Consider for a logic |~ on C the properties 

(LLE), (CCL), (SC), (PR), (CUM), [A-rmn) hold for all T, T' C C. 

and for a function fi : Dc — » V{M£) the properties 

(pdp) a is definability preserving, i.e. fi(M(T)) — M(T') for some T' 

(fi c), (aPR), (jiCUM), (uA) 

for all X,Y G D c . 

It then holds: 

(a) If fj, satisfies (jjdp), {p C), {uPR), then k defined by f := T» := Th{a{M{T))) satisfies (LLE), (CCL), (SC), (PR). If 
u satisfies in addition (uCUM), then (CUM) will hold, too. If p, satisfies in addition (uA), then (A.— min) will hold, too. 

(b) If k satisfies (LLE), (CCL), (SC), (PR), then there is fi : D c -> T{M C ) s.t. T = T» for all T C £ and /i satisfies 
(fidp), (/i C), (pPR), If, in addition, (CUM) holds, then (^.CUM) will hold, too. If, in addition, (^4— min) holds, then 
(/xA) will hold, too. 

Proof 

All properties except (A— min) and (fJ,A) are shown in Proposition ^. 3.^1 (page l55j) . But the remaining two are trivial. □ 



5.3 Two sequent calculi 

5.3.1 Introduction 

This section serves mainly as a posteriori motivation for our examination of weak closure conditions of the domain. The 
second author realized first when looking at Lehmann's plausibility logic, that absence of (U) might be a problem for 
representation - see |Sch96-3j or |Sch04| . 

Beyond motivation, the reader will see here two "real life" examples where closure under (U) is not given, and thus problems 
arise. So this is also a warning against a too naive treatment of representation problems, neglecting domain closure issues. 

5.3.2 Plausibility Logic 

5.3.2.0.1 Discussion of plausibility logic 

Plausibility logic was introduced by D. Lehmann Leh92a| ■ [Leh92b] as a sequent calculus in a propositional language 
without connectives. Thus, a plausibility logic language C is just a set, whose elements correspond to propositional 
variables, and a sequent has the form X |~ Y, where X, Y are finite subsets of £, thus, in the intuitive reading, 

f\X |~ \JY. (We use |~ instead of the h used in |Leh92aj . |Leh92b] and continue to reserve h for classical logic.) 

5.3.2.0.2 The details: 
Notation 5.3.1 

We abuse notation, and write X (~ a for X |~ {a}, X, a |~ Y for X U {a} |~ Y, ab Y for {a, b} |~ Y, etc. When 
discussing plausibility logic, X, Y, etc. will denote finite subsets of £, a, b, etc. elements of C. 

We first define the logical properties we will examine. 
Definition 5.3.1 

X and Y will be finite subsets of £, a, etc. elements of C. The base axiom and rules of plausibility logic are (we use the 
prefix "PI" to differentiate them from the usual ones): 

(P1I) (Inclusion): X |~ a for all atX, 

(P1RM) (Right Monotony): X ^ Y X ^ a,Y, 

(P1CLM) (Cautious Left Monotony): X k a, X k Y X, a k K 
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and as a special case of (P1CC): 

(P1UCC) (Unit Cautious Cut): X, a f~ Y, X |~ a, Y X f~ Y. 

and we denote by PL, for plausibility logic, the full system, i.e. {PIT) + (FIRM) + (PICLM) + (PICC). □ 

We now adapt the definition of a preferential model to plausibility logic. This is the central definition on the semantic side. 
Definition 5.3.2 

Fix a plausibility logic language C. A model for C is then just an arbitrary subset of C. 

If M. := (M, -<) is a preferential model s.t. M is a set of (indexed) C— models, then for a finite set X C C (to be imagined 
on the left hand side of |~!), we define 

(a) m \= X iff X C m 

(b) M(X) := {to: (m, i) £ M for some z and m |= X} 

(c) ^i(X) := {to 6 M(X): 3(m,i) £ M.-d(m',i') G M (m' £ M{X) A (m',i') -< (to, i) ) } 

(d) X H.M Y iff Vm £ /x(X).m n Y ± 0. □ 

(a) reflects the intuitive reading of X as /\ X, and (d) that of Y as \J Y in X |~ Y. Note that X is a set of "formulas" , and 
H{X) =n M (M{X)). 

We note as trivial consequences of the definition. 
Fact 5.3.1 

(a) a \=m b iff for all to £ [i(a).b £ m 

(b) X \= M Y iff M (X) C |J{M(6) : 6 £ Y} 

(c) m £ (i(X) A X C X' A m £ M(X') m £ /i(A'). □ 

We note without proof: (P/-T) + (PIRM) + (PICC) is complete (and sound) for preferential models - see [Sch96-3| or 
|Sch04] for a proof. 

We note the following fact for smooth preferential models: 
Fact 5.3.2 

Let U, X, Y be any sets, M be smooth for at least {Y, X] and let /i(Y) C U U X, fi(X) C {J, then In7n /x(C/) C /x(Y). 
(This is, of course, a special case of (/iCuml). 

Example 5.3.1 

Let C := {a, b, c, d, e, /}, and X := {a |~ 6, 6 |~ a, a |~ c, a |^ / d, dc |^ &a, dc |~ e, /c6a |~ e}. We show that X does not 
have a smooth representation. 

Fact 5.3.3 

X does not entail a |~ e. 

See |Sch96-3] or [Sch04] for a proof. 

Suppose now that there is a smooth preferential model A4 = (M, -<) for plausibility logic which represents |~, i.e. for all 
X,Y finite subsets of L X |- Y iff X \= M Y. (See Definition [5X2] (page 1U) and Fact EO (page [gSJ).) 

a |^ a, a |^ 6, a |~ c implies for to £ fi(a) a,b,c £ to. Moreover, as a |~ df, then also d £ to or / £ m. As a |^ e, there 
must be to £ /i(a) s.t. e g" to. Suppose now to £ /i(a) with / £ m. So a, 6, c, / £ to, thus by to £ /x(a) and Fact l5.3TT1 (page 
I92p . m £ /i(a, 6, c, /). But fcba |~ e, so e £ to. We thus have shown that m £ /i(a) and / £ to implies e £ to. Consequently, 
there must be to £ //(a) s.t. d £ to, e £" to. Thus, in particular, as cd |~ e, there is m £ /x(a), a,b, c, d £ m ^ fi(cd). 
But by cd |~ ab, and 6 |~ a, //(cd) C M(a) U M(&) and //(&) C M(a) by Fact [OH (page W^. Let now T := M(cd), 
i? := M(a), S := M(b), and /i^vi be the choice function of the minimal elements in the structure Ai, we then have by 
h(S) = hm(M(S)): 

1. ixm(T) CPUS, 

2. hm(S) c P, 

3. there is to £ S 1 n T n /i.m(P), but m g 1 /zx(T), 
but this contradicts above Fact 15.3.21 (page |9"2"| . 
□ (Counterexample D — 6.2.1) 
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5.3.3 A comment on the work by Arieli and Avron 

We turn to a similar case, published in [AAOO . Definitions are due to [A AOOj , for motivation the reader is referred there. 
Definition 5.3.3 

(1) A Scott consequence relation, abbreviated scr, is a binary relation b between sets of formulae, that satisfies the following 
conditions: 

(s-r) if r n A ^ 0, the r h A, 

(M) if r h A and r C T', A C A', then V b A', 
(C) if T b ip, A and V , ip b A', then T, V b A, A'. 

(2) A Scott cautious consequence relation, abbreviated sccr, is a binary relation |~ between nonempty sets of formulae, 
that satisfies the following conditions: 

(s-R) if r n A ^ 0, the r |~ A, 

(CM) if r |~ A and T (~ ip, then T, tp |~ A, 
(CC) if T (~ tp and T, V h A . then r (~ A. 

Example 5.3.2 

We have two consequence relations, h and |~ . 

The rules to consider are 

t ncn r|~i/ii,A...r|~0„.Ar.yii....,i/'n|~A 

nwn r|~0i,Ai=l ...nr,^i,...,t/i„H0 

r| ~0 , a 

Cum r, A ^ 0, r b a r |~ a 
rm r |~ a r |- tp, a 

s-i? rnA^0^r|-A 

m r h a, r c r, a c a' => r b A' 

r rihi/i,Air 2 ,V'HA2 

ri,r 3 |-Ai,Aa 

Let C be any set. Define now r h A iff Tn A ^ (S. Then s - i? and M for h are trivial. For C : If T x n Ai ^ or 
Ti n Ai ^ 0, the result is trivial. If not, ?/) S and ?/> e A 2 , which implies the result. So h is a scr. 

Consider now the rules for a sccr which is b —plausible for this b . Cum is equivalent to s— R, which is essentially (P1I) 
of Plausibility Logic. Consider RW n . If <\> is one of the ipi, then the consequence T |~ 0, A is a case of one of the other 
hypotheses. If not, 4> £ T, so T |~ by s— R, so T |~ 0, A by RM (if A is finite). So, for this b, RW n is a consequence of 
s-R+ RM. 

We are left with LCC n , RM, CM, s— R, it was shown in |Sch04] and |Sch96-3] that this does not suffice to guarantee 
smooth representability, by failure of (fiCuml). 

5.4 Blurred observation - absence of definability preservation 
5.4.1 Introduction 

Lack of definability preservation results in uncertainty. We do not know exactly the result, but only that it is not too far 
away from what we (seem to) observe. 

Thus, we pretend to know more than we know, and, according to our general policy of not neglecting ignorance, we should 
branch here into a multitude of solutions. 

We take here a seemingly different way, but, as a matter of fact, we just describe the boundaries of what is permitted. So, 
everything which lies in those boundaries, is a possible solution, and every such solution should be considered as equal, 
and, again, we should not pretend to know more than we actually do. 

5.4.1.1 General remarks, affected conditions 



We assume now - unless explicitly stated otherwise - V C V(Z) to be closed under arbitrary intersections (this is used for 
the definition of / ^~ v ) and finite unions, and 0, Z 6 y. This holds, of course, for y — Dc, £ any propositional language. 

The aim of Section 15.41 (page |93|) is to present the results of |Sch04) connected to problems of definability preservation in 

a uniform way, stressing the crucial condition X n Y = X n Y . This presentation shall help and guide future research 
concerning similar problems. 

For motivation, we first consider the problem with definability preservation for the rules 
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( |~=) T h T', Con{T', T) ^ T = T'UT holds. 

which are consequences of 

(fiPR) XCY=> fj,(Y) nic n(X) or 

(M =) X C y, M (F) nl^N /i(y) nl = //(X) respectively 
and definability preservation. 



Example 14.2.11 (page 
Example 15.4.11 (page [94 



showed that in the general case without definability preservation, (PR) fails, and the following 
shows that in the ranked case, ( |~=) may fail. So failure is not just a consequence of the very 
liberal definition of general preferential structures. 

Example 5.4.1 

Take {pi : i £ lu} and put m := m^ Pi , the model which makes all pi true, in the top layer, all the other in the bottom 

layer. Let ml ^ to, V := 0, T := Th(m, to'). Then Then T = T', so Con(T J , T), T = Th(m'), T T UT = T. 
□ 



We remind the reader of Definition 12.1.51 (page [2T]) and Fact 12.1.11 (page |2"5|) , partly taken from |Sch04j . 
We turn to the central condition. 

5.4.1.2 The central condition 

We analyze the problem of (PR), seen in Example 15.4. II (page [94]) (1) above, working in the intended application. 



(PR) is equivalent to M(TUT') C M(T U T'). To show (PR) from (//Pi?), we argue as follows, the crucial point is marked 
by"?": 



M(TUT') = M(Th{n(M TuT >))) = n{M T \jr>) 2 K m tut>) = ^i(M T nM T ') 2 (by (fJ,PR)) n(M T ) fl M T <7 n{M T )nM T ' 
= M(Th(fi(M T )))nM T > = M(T)nM T > = M(TUT'). If fi is definability preserving, then fi(M T ) = fi(M T ), so "?" above 
is equality, and everything is fine. In general, however, we have only h{Mt) Q m(-^t)j an d the argument collapses. 



But it is not necessary to impose ^l(Mt) = /i(Mt), as we still have room to move: /J,(Mtut') 12 m(-^tut')- (We do 
not consider here (i(Mt D Mt>) 2 h(Mt) fl Mt> as room to move, as we are now interested only in questions related 

to definability preservation.) If we had fx(M T ) C\M T > C fi(M T ) n M T > , we could use /J,(M T ) fl M T ' Q fj,(M T D M T >) = 

fi(MTuT') and monotony of to obtain (i(Mt) DMt' C //(My) (~l Mt> C //(Mt H M^v) = /i(MtuT') ■ If, for instance, 

T" = {-0}, we have nM T ' = /i(M T ) n M T ' by Fact [2~1~T1 fpage (CZ n +). Thus, definability preservation 

is not the only solution to the problem. 

We have seen in Fact |2"XT1 (page |25J) that XlJ~y = ^UF^, moreover X - Y = X n CF (CY the set complement 
of Y), so, when considering boolean expressions of model sets (as we do in usual properties describing logics), the central 
question is whether 

(~ n) X~T)Y = ^Tn^ 
holds. 

We take a closer look at this question. 

XTl"? C X n Y holds by Fact [2~TTT1 (page EH) (6). Using (CZU) and monotony of <^>, we have X H Y = 

((x n y) u (x - Y)j n \(x nF)u(7- X)) = ((Xny)u(x-y)) n ((ln7)u(F-l)) = 7n?u (x~^~y n v - ^?), 
thus c xn~y iff 

(~ n') / y~*~x n x~^~y c x~nY 

holds. 

Intuitively speaking, the condition holds iff we cannot approximate any element both from X — Y and X— Y, which cannot 
be approximated from X PI Y, too. 

Note that in above Example EXT] (page [94]) (1) X := n(M T ) = Mc - {n'}, Y := M T > = {n,n'}, X~^Y = Mc, 

Y-X = {n 1 }, X~fYY = {n}, and = {n, n'}. 

We consider now particular cases: 
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(2) If X S y and Y 6 y, then A 7 ^"? C X and C Y", so T^PhF^? C X n Y C An"? and (~ n) trivially 
holds. 

(3) I £ J and Cl£ J together also suffice - in these cases Y~^~X n a"^"? = : Y~^~X = yT)CX C CI, and 

A~^~? C X, so Y~^~X r\'5(^~Y C XnCI = C AYV?. (The same holds, of course, for Y.) (In the intended application, 
such X will be M(<f>) for some formula 0. But, a warning, fi(M(<j>)) need not again be the M(tp) for some ip.) 
We turn to the properties of various structures and apply our results. 

5.4.1.3 Application to various structures 

We now take a look at other frequently used logical conditions. First, in the context on nonmonotonic logics, the following 
rules will always hold in smooth preferential structures, even if we consider full theories, and not necessarily definability 
preserving structures: 

Fact 5.4.1 

Also for full theories, and not necessarily definability preserving structures hold: 

(1) (LLE), (RW), (AND), (REF), by definition and (/i C), 

(2) (OR), 

(3) (CM) in smooth structures, 

(4) the infinitary version of (CUM) in smooth structures. In definability preserving structures, but also when considering 
only formulas hold: 

(5) (PR), 

(6) (f~=) in ranked structures. 
Proof 

We use the corresponding algebraic properties. The result then follows from Proposition ^. 3. 4l (pagel36 |) . □ 

We turn to theory revision. The following definition and example, taken from [Sch04J shows, that the usual AGM axioms 
for theo ry re visio n fail in distance based structures in the general case, unless we require definability preservation. See 
Chapter 18.21 (page ll58|) for discussion and motivation. 

Definition 5.4.1 

We summarize the AGM postulates (K * 7) and (K * 8) in (*4) : 



(*4) If T * T is consistent with T" , then T * (T U T") = (T * T 1 ) U T". 
Example 5.4.2 

Consider an infinite propositional language C. 

Let X be an infinite set of models, to, mi, m 2 be models for C. Arrange the models of C in the real plane s.t. all x G X 
have the same distance < 2 (in the real plane) from to, TO2 has distance 2 from to, and mi has distance 3 from to. 

Let T, T-y, T2 be complete (consistent) theories, T' a theory with infinitely many models, M(T) — {to}, M(T\) = {toi}, 
M(T 2 ) = {to 2 }. M(T) = AU{to!,to 2 }, M(T") = {mi, ma}. 

Assume Th(X) — T', so X will not be definable by a theory. 



Then M(T) | M(T') = X, but T * T = Th(X) = T . So T * V is consistent with T" , and (T * T') U T" = T" . But 

V U T" = T", and T * (T U T") = T 2 + T", contradicting (*4). 

□ 



We show now that the version with formulas only holds here, too, just as does above (PR), when we consider formulas only 
- this is needed below for T" only. This was already shown in Sch04j, we give now a proof based on our new principles. 

Fact 5.4.2 

(*4) holds when considering only formulas. 
Proof 

When we fix the left hand side, the structure is ranked, so Con(T * T , T") implies (M T \ M T < ) n M T „ ^ by T" = {ip} and 

, * < , * , 

thus M T I M T /ut» = M T I (M T ,C\M T n) = (M T \ M T ,)C\M T „. So M{T*{T'UT")) = M T \ M T , VT " = (Mr I M T >) n M T » = 
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5.4.2 General and smooth structures without definability preservation 
5.4.2.1 Introduction 

Note that in Sections 3.2 and 3.3 of |Sch04j . as well as in Proposition 4.2.2 of |Sch04| we have characterized fi : y — + y or 
: y x y — > y, but a closer inspection of the proofs shows that the destination can as well be assumed V{Z), consequently 
we can simply re-use above algebraic representation results also for the not definability preserving case. (Note that the 
easy direction of all these results work for destination V{Z), too.) In particular, also the proof for the not definability 
preserving case of revision in |Sch04j can be simplified - but we will not go into details here. 

(U) and (P|) are again assumed to hold now - we need (P|) for 

The central functions and conditions to consider are summarized in the following definition. 
Definition 5.4.2 

Let n : y -> y, we define m : y -> T(Z) : 

fi (U) :={i£(/: -ay 6 y(Y C U and x 6 Y - n(Y))}, 

(H(JJ) := {x G U : ~3Y G y(p(Y) C U and x G Y - »(Y))}, 

fi 2 (U) := {xeU : -3Y G y(p(U U Y) C [/ and x £ F - /xfT))} 

(note that we use (U) here), 

M3(C/) := Lr G [/ : Vy G t/.z G 

(we use here (U) and that singletons are in y). 

"Small" is now in the sense of Definition 12.1.51 (pagel2"7)). 

(fiPRO) fi(U) - jUq(ZJ) is small, 

(jjlPRI) n(U) - m(U) is small, 

(jiPR2) n{U) - n 2 (U) is small, 

{nPR3) n(U) - nz{U) is small. 

(fiPRO) with its function will be the one to consider for general preferential structures, (/iPR2) the one for smooth 
structures. 



5.4.2.1.1 A non-trivial problem 

Unfortunately, we cannot use (fiPRO) in the smooth case, too, as Example 15.4.41 (page |9"5| below will show. This sheds 
some doubt on the possibility to find an easy common approach to all cases of not definability preserving preferential , and 
perhaps other, structures. The next best guess, (fiPRl) will not work either, as the same example shows - or by Fact 15.4/31 
(page|97j) (10), if \i satisfies (fi,Cum), then Ho(U) — Hi{U). (^,PR3) and ^ 3 are used for ranked structures. 

We will now see that this first impression of a difficult situation is indeed well founded. 

First, note that in our context, /i will not necessarily respect ([iPR). Thus, if e.g. x G Y — ft(Y), and fi(Y) C U, wc cannot 
necessarily conclude that x $ fi(U U Y) - the fact that x is minimized in U U Y might be hidden by the bigger fi(U U Y). 

Consequently, we may have to work with small sets ( Y in the case of /i2 above) to see the problematic elements - recall that 
the smaller the set n(X) is, the less it can "hide" missing elements - but will need bigger sets (U U Y in above example) 
to recognize the contradiction. 

Second, "problematic" elements are those involved in a contradiction, i.e. contradicting the representation conditions. 
Now, a negation of a conjunction is a disjunction of negations, so, generally, we will have to look at various possibilities of 
violated conditions. But the general situation is much worse, still. 

Example 5.4.3 

Look at the ranked case, and assume no closure properties of the domain. Recall that we might be unable to see fx(X), 

but see only fj,(X) . Suppose we have n(X 2 - K x 2)) + 9, /4^n(X 3 - fx(X 3 )) ^ 0, (j,(X n _i) n(X n - fJ,(X n )) + 0, 

n{X n ) C\{X\ — fi(Xi)) ^ 0, which seems to be a contradiction. (It only is a real contradiction if it still holds without 
the closures.) But, we do not know where the contradiction is situated. It might well be that for all but one i really 

jLi(Xj) fl (Xj+i — /i(X;_|_i)) ^ 0, and not only that for the closure /ipQ) of fi(Xi) /i(Xj) n(Xj_|_i — /i(Xi + i)) ^ 0, but we might 

be unable to find this out. So we have to branch into all possibilities, i.e. for one, or several i fJ-(Xi) P\(Xi+i — fi(Xi+i)) ^= 0, 

but/x(X i )n(X i+1 -/x(X i+1 ))=0. 

□ 
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The situation might even be worse, when those fi(Xi) Pi(X i+1 — /i(JQ + x)) 7^ are involved in several cycles, etc. Conse- 
quently, it seems very difficult to describe all possible violations in one concise condition, and thus we will examine here 
only some specific cases, and do not pretend that they are the only ones, that other cases are similar, or that our solutions 
(which depend on closure conditions) are the best ones. 

5.4.2.1.2 Outline of our solutions in some particular cases 

The strategy of representation without definability preservation will in all cases be very simple: Under sufficient conditions, 
among them smallness (uPRi) as described above, the corresponding function fn has all the properties to guarantee 
representation by a corresponding structures, and we can just take our representation theorems for the dp c ase, to show 
this. Using smallness again, we can show that we have obtained a sufficient approximation - see Proposition 15.4.51 (page 
I55]l , Proposition 15X51 (page \5S§ , Proposition [5X91 (page UIM ■ 

We first show some properties for the fii, i = 0, 1,2. A corresponding result for fi^ is given in Fact 15.4.71 (page [99]) below. 
(The conditions and results arc sufficiently different for /13 to make a separation more natural.) 

Property (9) of the following Fact 15.4.31 (page 157)) fails for fio and fi\, as Example 15.4.41 (page !55|) below will show. We will 
therefore work in the smooth case with /x 2 ■ 

5.4.2.2 Results 



Fact 5.4.3 

(This is partly Fact 5.2.6 in |Sch04j .1 

Recall that V is closed under (U), and fi : V — > y. Let A, B, £7, £7', X, Y be elements of jV and the fii be defined from fi as 
in Definition 15.4.21 (page [Mi)), i will here be 0, 1, or 2, but not 3. 

(1) Let /j, satisfy (fi C), then fii(X) C fio(X) and i< 2 (X) C fi (X), 

(2) Let fi satisfy {u C) and (fiCum), then /i(£7 U £7') C £7 <=> fi(U U U') = fi{U), 

(3) Let fj, satisfy (fi C), then fii(U) C /i(£7), and fii(U) C £7, 

(4) Let fi satisfy (fi C) and one of the (fiPRi), then fi(A U B) C f i(A) U fi(B), 

(5) Let fi satisfy (fi C) and one of the (fiPRi), then fii(X) C fii(X), 

(6) Let n satisfy (fi C), (fiPRi), then ^(£7) C £7' ^ /j([7) C £7', 

(7) Let /j satisfy f> C) and one of the (fiPRi), then I C 7, U [/) C I 4 U P) C 

(8) Let fi satisfy (fi C) and one of the (fiPRi), then X(lfii(Y) C /ij(X) - so (fiPR) holds for /Xj, (more precisely, 
only for /x 2 we need the prerequisites, in the other cases the definition suffices) 

(9) Let fi satisfy (fi C), (fiPR2), (fiCum), then fi2(X) cyci^> M2PO = - so (fiCum) holds for // 2 . 

(10) (/x C) and (fiCum) for /i entail fio(U) = fi\(U). 

Proof 

(1) Mi(X) C £i P0 follows from (/x C) for fi. For i< 2 : By F C £7, £7 U F = [/, so /j(C7) C £7 by (it C). 

(2) fi(U U £7') C U C £7 U (7' ^ (AlC c/M) U £7') = M (£7). 

(3) fii(U) C £7 by definition. To show fii(U) C /i(£7), take in all three cases F := £7, and use for £ = 1,2 (^t C). 

(4) By definition of /i , we have /i (-A UB) UUB, /i (A U B) n (A - = 0, /x (A U 6) fl (B - ^(B)) = 0, so 
fi (A UB)fl4C /i(A), fi (A U B) fl B C and /x (A UB) C ^(A) U By fi : y -> y and (U), ju(A) U /i(5) G jV. 
Moreover, by (3) fi (A{JB) C //(AlB), so it (AUB) C (//(A)U/i(£))n/i(Aj.B), so by (1) /^(AlB) C (/x(A)U/i(B))n/i(AUB) 
for i = 0, ... ,2. If fi(A UB) ^ ^(A) U fi(B), then U ft(B)) n /x(A UB) C /x(A U S), contradicting (fiPRi). 

(5) Let y G jV, fi(Y) CU,x€Y- fi(Y), then (by (4)) fi{U U F) C /i(J7) U //(F) C [7. 

(6) " " by (3). "=>-": By (fiPRi), fi(U) - ^(f/) is small, so there is no X G JV s.t. ^(JJ) C X C fi{U). If there were 
U' G JV s.t. /ii(J7) C [/', but /i(C7) ^ U', then for X := U' H /i(i7) G jV, /u f (J7) C X c /i(C/"), contradiction. 

(7) /i(F U f7) = u(Y U X U U) C (4) M (F) U /i(X U U) C F U X = F. 

(8) For i = 0, 1 : Let x G X — fio(X), then there is A s.t. A C X, x £ A - u(A), so A C Y. The case i = 1 is similar. 
We need here only the definitions. For i = 2 : Let a; G X - ^a(X), A s.t. x E A — u(A), u(X UA)CI, then by (7) 
/i(F U A) C F. 

(9) " C ": Let x G fi^iX), so a; G F, and x G A*2(F) by (8). " D ": Let x G jLt2(F), so x G A. Suppose x ^ it 2 (A), so there 
is £7 G JV s.t. xeU- fi(U) and /x(A U U) C A. Note that by /i(A U J7) C X and (2), /x(A U U) = u{X). Now, /i 2 (A) C F, 
so by (6) u(X) C F, thus /x(A U C7) = a*(A) C F C F U C7 C X U [7, so fi(Y U £7) = /i(A U U) = fi{X) C F by (/iCW), so 
x ^ /i 2 (F), contradiction. 

(10) /ji(LT) C ^(17) by (1). Let F s.t. /x(F) C [7, x G F - it(F), x G £7. Consider F n £7, x G F n £7, u{Y) C F n £7 C F, so 
ufFl = u(F n [7 s ) bv (uCum). and x 6^ u(F n £71 Thus. (Wm C ^ (U). 
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Fact 5.4.4 

In the presence of (// C), (/xCum) for /x, we have: (/xPPO) 4=> (pPRl), and (/xPP2) =>■ (xtPPl). 
If (/xPP) also holds for /x, then so will (/xPPl) => (/xPP2). 
(Recall that (U) and (n) are assumed to hold.) 

Proof 

(xtPPO) & (pPRl) : By Fact 15X31 (page 197)1 . (10), fi (U) = pi(U) if (/xCW) holds for /x. 

(pPR2) => (/xPPl) : Suppose (pPR2) holds. By (xiPP2) and (5), p 2 (U) C /Ji(Z7), so /x(C7) - ^(ZJ) C /x(t7) - /x 2 (f7). By 
\pPR2), fi(U) - /x 2 (f/) is small, then so is p(U) - pi(U), so (fiPRl) holds. 

(/xPPl) => (pPR2) : Suppose (/xPPl) holds, and (pPR2) fails. By failure of (fiPR2), there is X e F s.t. /x 2 (J7) C A C 
/*([/■). Let a; G fi(U)-X, as x £ /x 2 (f7), tner e is V s.t. /x(f7 U F) C [/, x G F - /x(F). Let Z := U U F U X. By (/xPP), 
x g jti(f7UF), and x £ ^(f/UFUX). Moreover, /x([/UAUF) C p,(UUY)U p(X) by Fact [5X31 (page 1971) (4), /x(£/UF) C [/, 
A*(A) C X C ^(C7) C U by prerequisite, so p{U U A U F) C [/ C[/UFC[/UAUF, so p(U U A U F) = p(U UY) CU. 
Thus, x (jL ni(U), and fii(U) C X, too, a contradiction. 
□ 



Here is an example which shows that Fact 15.4.31 (page [97]). (9) may fail for /xq and \x\. 
Example 5.4.4 

Consider C with v(C) := {pt : i £ a;}. Let to ^= p , let to' G M(po) arbitrary. Make for each n £ M(po) — one copy of 
m, likewise of to', set (m, n) -< {ml , n) for all n, and n -< (m, n), n -< (m', n) for all n. The resulting structure 2 is smooth 

and transitive. Let 3^ := -De, define (J,(X) := fiz(X) for X <E y. 

Let m' € X — fiz(X). Then m G X, or M(po) C X. In the latter case, as all m" s.t. m" ^ m', m" |= p^ are minimal, 

M(p ) - {m'} C /^(X), so m' G /xI(X) = /x(X). Thus, as /iz(X) C /x(X), if m' £ X — /x(A), then m G X. 
Define now X := M(p ) U {m}, F := M{p ). 

We first show that fi does not satisfy (fxCum). (Jto(X) := {x E X : -BA G ,y(A C X : x G A — /x(A))}. m ^ xi (A), 

as m G' jix(X) = fJ,z{X) . Moreover, m' (Iq(X), as {to, to'} G y, {to, m'} C X, and /x({to, to'}) = |Uz({m, m'}) — {to}. 
So /xo(A) C F C X Consider now /xo(F). As to E' F, for any A G F, A C F, if m' G A, then m' G jtx(-A), too, by above 
argument, so to' G /xo(F), and /xo does not satisfy (pbCum). 

We turn to fi%. 

By Fact 15.4.31 (page [57]) (1)) Mi(^) C /xo(X), so m,m' ^ //i(X), and again Mi(^) S F C X. Consider again ^ii(F). 

As m F, for any A £ y, (jt(A) C F, if m' G A, then to' G //(A), too: if M(po) - {m'} C A, then m' G if 
M(po) — 2 but to' G -A, then either to' G Hz(A), or to G Hz(A) C /i(A), but to F. Thus, (fiCum) fails for /ii, 
too. 

It remains to show that satisfies (/i C), (fiCum), (fiPRO), (uPRl). Note that by Fact 14 . 2 . T§1 (page 175)) (3) and Proposition 
14.2.251 (page 177j) /U^ satisfies (fxCum), as -Z is smooth, (/i C) is trivial. We show (ixPRi) for i = 0, 1. As ^z(A) C 
by {fiPR) and (/iCum) for /i^, fiz(X) C fj,o(X) and fJ,z{X) C f/,i(X) : To see this, we note Hz{X) C ^o(A) : Let 
x G X-jiioCX), then there is F s.t. x G F-/i(F).F C A, but /x^(F) C ^(F), so by F C A and (/iPP) for ^ x ^ Hz(X). 
fJ>z(X) C /ii(A) : Let x G A - ^i(A), then there is F s.t. x G F — ^(F), (u(F) C A, so x G F - ^z(F) and /U Z (F) C A. 
^z(AUF) C A t z (A)U/i Z (F) C A C AUF, so ^(A UF) = ^(A) by (//Cum) for x G Y - fi z (Y) => x ^^(AUF) 
by (fiPR) for /x^, so x g" fi z (X). 

But by Fact [5X31 (page 197|) . (3) ^(A) C ^(A). As by definition, fi(X) - /i^(A) is small, (fiPRi) hold for i = 0, 1. It 
remains to show (fiCum) for /i. Let n{X) C F C A, then fiz(X) C /i(A) C F C A, so by (/iCum) for jU.z /X2:(A) = //z(F), 
so by definition of /x, /i(A) = /i(F). 

(Note that by Fact 15.4.31 (page [97]) (10), /io = Ml follows from (//Cum) for /x, so we could have demonstrated part of the 

properties also differently.) 

□ 



By Fact 15.4.31 (page!97 |) (3) and (8) and Proposition ^. 2.131 (page 1571) . /io has a representation by a (transitive) preferential 
structure, if p, : y — > 3^ satisfies (/x C) and (pPRO), and /xo is defined as in Definition 15.4.21 (page [96 ]) . 

We thus have (taken from |Sch04j . Proposition 5.2.5 there): 



Proposition 5.4.5 

T o+ 7 an arliifvorv oof M (~ T> f 7\ ,, ■ "V ^ 
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(a) If n satisfies (/i C), (/j,PRO), then there is a transitive preferential structure Z over Z s.t. for all P G y /u(P) — u z (U) ■ 

(b) If Z is a preferential structure over Z and [i : y — > y s.t. for all U G y fi(U) = fi z {U), then /i satisfies (fj, C), (aPRO). 

Proof 

(a) Let satisfy (/i C), (uPRO). ^ as defined in Definition 15.4.21 (page [96 ]) satisfies properties (/i C), (fJ,PR) by Fact 15.4.31 
(page l97|). (3) and (8). Thus, by Proposition 14.2.131 (pagel67 |) . there is a transitive structure Z over Z s.t. /xq = /t*z, but 

by (jiPRO) n(U) = fi (U) = /x z (P) for U G y. 

(b) (fx C) : fx z (U) C P, so by P G TV n{U) = ^{JJ) C (7. 

: If (pPRO) is false, there is P G y s.t. for U' := \J{Y' - /j(Y') : Y' G TV, Y' C U} ^>~P c u(U). By 
Pz(Y') C /i(Y'), V - jti(y') cy'- /i 2 (Y'). No c °Py 01 an y x e Y' - fiz(Y') with y C P, y G y can be minimal in Z[P. 

Thus, by /^(P) C /x(P), ^(P) C /i(P) - P', so /I^(P) C /i(P) - U' C //(P), contradiction. 
□ 

We turn to the smooth case. 

If // : 3^ — ► y satisfies (u C), (/iPP2), (uCUM) and /12 is defined from \i as in Definition 15.4.21 (page lMl) . then /X2 satisfies 
(/•* C)i (M-P-R): (fJ-Cum) by Fact 15.4.31 (page [97)) (3), (8), and (9), and ca n thus b e represented by a (transitive) smooth 
structure, by Proposition l5.1.ll (pagc l83|) . and we finally have (taken from Sch04 , Proposition 5.2.9 there): 

Proposition 5.4.6 

Let Z be an arbitrary set, y C V(Z), /j, : y — > y, y closed under arbitrary intersections and finite unions, and 0, Z G y, 
and let / "?~^ be defined wrt. y. 

(a) If a satisfies (/i C), (aPR2), (fiCUM), then there is a transitive smooth preferential structure Z over Z s.t. for all 

uey m(p) = MP)- 

(b) If Z is a smooth preferential structure over Z and fi : y ^ y s.t. for all P G y n(U) = (j,z(U), then /i satisfies (/i C), 
(nPR2), (jiCUM). 

Proof 

(a) If /x satisfies (/iC), (uPR2), (uCUM), then /Lt2 defined from /x as in Definition 15.4.21 (page lM)) satisfies (/iC), (uPR), 
(fxCUM) by Fact 15.4731 (page [97f (3), (8) and (9). Thus, by Proposition 15.1.11 (page [83)) , there is a smooth transitive 

preferential structure Z over Z s.t. /i 2 = (J-z, but by (^iPR2) /x(P) = M2(P) = ^z{U) ■ 

(b) (/i C) : /x z (P) C P => /x(P) = ^(P) C P by P G 3>. 

(^PP2) : If (pPR2) fails, then there is P G s.t. for U' := \J{Y' - fi(Y') :Y'ey, fi(U U Y') C P} ^I(Jjy~l? c /Lt(P). 

By jti 2 (y) C /x(Y'), y - /i(Y') C y - ^(Y'). But no copy of any x G Y> - fi z (Y') with ^z(P U Y') C M (P U Y' ) C P 
can be minimal in Z\U : As x G Y' — /x^(Y'), if is any copy of x, then there is (y,j) -< (x,i), y G Y'. Consider now 
PU y. As (x, i) is not minimal in Z[P U Y', by smoothness of Z there must be (z,k) ~< (x, i), (z, k) minimal in Z\U U Y'. 
But all minimal elements of Z\U U Y' must be in Z\U, so there must be (z, k) -< (x,i), z G P, thus (x, «) is not minimal 

in Z[P. Thus by /x.z(P) C /Lt(P), Mz(^) ^ ^(C 7 ) - C 7 ', so /x^(P) C /i(pf- P' C /i(P), contradiction. 

(jiCUM) : Let C Y C X. Now C /T^(X) = fj,(X), so by smoothness of Z a z {Y) = u z (X), thus 

fi(X) = Tzlx) = Sy = /x(Y). □ 



5.4.3 Ranked structures 

We recall from |Sch04j and Section f5. 1.31 (page [85]) above the basic properties of ranked structures. 

We give now an easy version of representation results for ranked structures without definability preservation. 

Notation 5.4.1 

We abbreviate ^t({x,y}) by /j,(x,y) etc. 



Fact 5.4.7 
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Let for u : y -> y hold: 

(/x =) for finite sets, (jtx G), (fiPR3), (/x0/in). 

Then the following properties hold for /x 3 as defined in Definition 15.4.21 (page [95]) : 

(1) /xspQC/ipO, 

(2) for finite X, /jt(X) = MsPO, 

(3) (/x C), 

(4) ( M Pi?), 

(5) (/x0/m), 

(6) (/i =), 

(7) {fi e), 

(8) /x(*) = mT(A)- 
Proof 

(1) Suppose not, so x G /x 3 (X), x G X — fi(X), so by (/x g) for //, there is j/ 6 J, i ^ /j,(x,y), contradiction. 

(2) By (/xPP3) for /j and (1), for finite U fi(U) = u 3 (U). 

(3) (/x C) is trivial for /X3. 

(4) Let X C Y", x G M3(5^) n X, suppose x G X — ^(X), so there is y G X C y, x ^ fi(x,y), so x ^ /X3(y). 

(5) (/x0/m) for /i3 follows from (11% fin) for /x and (2). 

(6) Let X C y, y G M3(y) nl,i£ ^(1), we have to show x G fis(Y). By (4), y G Ha(X). Suppose x £ ^(Y). So there 
is 2 G y.x g" fi(x,z). As y G ^(Y), y G (J,(y,z). As x G fi3(X), x G /J,(x,y), as y G /x 3 (X), y G u(x,y). Consider {x, y,z}. 
Suppose y g 1 /i(x,y, z), then by (/x g) for (j,, y £ fi(x,y) or y ^ /x(y, z), contradiction. Thus y G /x(x,y, z) n /i(x,y). As 
x G /i(x,y), and (/x =) for /x and finite sets, x G /x(x,y, z). Recall that x g" /i(x, z). But for finite sets /1 = /X3, and by (4) 
(pPR) holds for /i3, so it holds for /1 and finite sets, contradiction 

(7) Let x G X — /13(A), so there is y G A.x g 1 /x(x, y) = (J,s(x, y). 

(8) As M (X) G y, and /x 3 (A) C ^(X) C /i(X), so by (/iPP3) lZ(X) = /x(X). 
□ 



Fact 5.4.8 

If Z is ranked, and we define fi(X) :— fj,z(X), and Z has no copies, then the following hold: 

(1) fi z (X) = {x G X : Vy G X.x G /x(z, 2/)}, so ^z{X) = fi 3 (X) for I Gj, 

(2) /x(X) = fiz(X) for finite A, 

(3) (/x =) for finite sets for /x, 

(4) {a G) for /x, 

(5) (/x0/m) for /x, 

(6) (/xPP3) for /x. 

Proof 

(1) holds for ranked structures. 

(2) and (6) are trivial. (3) and (5) hold for fiz, so by (2) for /x. 

(4) If x /x(A), then x g" fiz(X), (u g) holds for jtx.z, so there is y G A s.t. x g! tx^(x, y) = /x(x, y) by (2). 
□ 



We summarize: 
Proposition 5.4.9 

Let Z be an arbitrary set, y C V(Z), /x : y — > 3^, 3^ closed under arbitrary intersections and finite unions, contain 
singletons, and 0, Z G 3^ and let be defined wrt. y. 

(a) If /x satisfies (/x =) for finite sets, (/x G), (fiPR3), (/x0/in), then there is a ranked preferential structure Z without 
copies over Z s.t. for all U G y /x(£7) = (J<z(U) . 

fu\ Tf „ 0+,.,, 7 „,;+u„,,+ „™;„„ ,, . "\) , "\! r . *■ <v,„ „n tt r- ~\) ,.(tt\ _ "i" - Trn +u™ ,, 
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Proof 

(a) Let /i satisfy (/x =) for finite sets, (fi €), (/J.PR3), (/z0/m), then /13 as defined in Definition 15.4.21 (page [96]) satisfies 
properties [fx C), (//0/in), (/x =), (/x g) by Fact 15.4.71 (page 159")) . Thus, by Proposition l5.1.4l (page 155 )1 . there is a transitive 

structure Z over Z s.t. fi 3 = fi z , but by Fact [5X71 (page IM)) (8) (j,(U) = /i 3 (J7) = /xz(i7) for U <E y. 

(b) This was shown in Fact [5X81 (page [TUP")) . 
□ 



5.5 The limit variant 
5.5.1 Introduction 

Distance based semantics give perhaps the clearest motivation for the limit variant. For instance, the Stalnaker/Lewis 
semantics for counterfactual conditionals defines <fi > ip to hold in a (classical) model to iff in those models of <fi, which are 
closest to to, ip holds. For this to make sense, we need, of course, a distance d on the model set. We call this approach the 
minimal variant. Usually, one makes a limit assumption: The set of 0— models closest to to is not empty if <p is consistent 
- i.e. the </>— models are not arranged around to in a way that they come closer and closer, without a minimal distance. 
This is, of course, a very strong assumption, and which is probably difficult to justify philosophically. It seems to have its 
only justification in the fact that it avoids degenerate cases, where, in above example, for consistent <p m (= <p > FALSE 
holds. As such, this assumption is unsatisfactory. 

Our aim here is to analyze the limit version more closely, in particular, to see criteria whether the much more complex 
limit version can be reduced to the simpler minimal variant. In the limit version, roughly, ip is a consequence of cj), if ip 
holds "in the limit" in all <p— models. That is, iff, "going sufficiently far down" , ip will become and stay true. 

The problem is not simple, as there are two sides which come into play, and sometimes we need both to cooperate to 
achieve a satisfactory translation. 

The first component is what we call the "algebraic limit" , i.e. we stipulate that the limit version should have properties 
which correspond to the algebraic properties of the minimal variant. An exact correspondence cannot always be achieved, 
and we give a translation which seems reasonable. 

But once the translation is done, even if it is exact, there might still be problems linked to translation to logic. 

(1) The structural limit: It is a natural and much more convincing solution to the problem described above to modify 
the basic definition, and work without the rather artificial assumption that the closest world exists. We adopt what 
we call a "limit approach" , and define m |= <f> > ip iff there is a distance d! such that for all m' \= <f> and d(m, m') < d' 
ml \= ip. Thus, from a certain point onward, ip becomes and stays true. We will call this definition the structural 
limit, as it is based directly on the structure (the distance on the model set). 

(2) The algebraic limit: The model sets to consider are spheres around to, S := {to' G M((f>) : d(m,m') < d'} for some 
d', s.t. S ^ 0. The system of such S is nested, i.e. totally ordered by inclusion; and if m \= <p, it has a smallest 
element {m}, etc. When we forget the underlying structure, and consider just the properties of these systems of 
spheres around different to, and for different 0, we obtain what we call the algebraic limit. 

(3) The logical limit: The logical limit speaks about the logical properties which hold "in the limit", i.e. finally in all 
such sphere systems. 

The interest to investigate this algebraic limit is twofold: first, we shall see (for other kinds of structures) that there are 
reasonable and not so reasonable algebraic limits. Second, this distinction permits us to separate algebraic from logical 
problems, which have to do with definability of model sets, in short definability problems. We will see that we find common 
definability problems and also common solutions in the usual minimal, and the limit variant. 

In particular, the decomposition into three layers on both sides (minimal and limit version) can reveal that a (seemingly) 
natural notion of structural limit results in algebraic properties which have not much to do any more with the minimal 
variant. So, to speak about a limit variant, we will demand that this variant is not only a natural structural limit, but 
results in a natural abstract limit, too. Conversely, if the algebraic limit preserves the properties of the minimal variant, 
there is hope that it preserves the logical properties, too - not more than hope, however, due to definability problems. 

We give now the basic definitions for preferential and ranked preferential structures. 

Definition 5.5.1 

(1) General preferential structures 

(1.1) The version without copies: 
Let M := (U,^,). Define 

Y C X C U is a minimizing initial segment, or MISE, of X iff: 

(a) Vie G X3x G Y.y < x - where y ■< x stands for x -< y or x — y (i.e. Y is minimizing) and 

(b) Vy G Y, Vie G X(x -< y x G Y) (i.e. Y is downward closed or an initial part). 

(1.2) The version with copies: 

Let A4 := OA. -<) be as above. Define for Y C X C U 
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(a) V(x,i) eX3(y,j) &Y.(yJ) d (x,i) 
and 

(b) V(y,j) G Y,V(M) =► (a;,*) G Y). 

(1.3) For X CU, let A(X) be the set of MISE of X. 

(1.4) We say that a set A" of MISE is cofinal in another set of MISE X' (for the same base set X) iff for all Y' G X', there 
is Y e X, Y c Y'. 

(1.5) A MISE X is called definable iff {x : 3i.(x,i) £ X} E D C - 

(1.6) T <p> iff there is V G k{U\M{T)) s.t. Y (= <p. 

(WfM(T) := i) 6M : x G M (T)} - if there are no copies, we simplify in the obvious way.) 

(2) Ranked preferential structures 

In the case of ranked structures, we may assume without loss of generality that the MISE sets have a particularly simple 
form: 

For X C U A C X is MISE iff X ^ and Va G AVx G X(x -<i a V x±a ^ x <E A). (A is downward and horizontally closed.) 

(3) Theory Revision 

Recall that we have a distance d on the model set, and are interested in y G Y which are close to X. 
Thus, given X, Y, we define analogously: 
B C Y is MISE iff 

(1) 5^0 

(2) there is c?' s.t. B := {y G Y : 3x G X.d(x,y) < d'} (we could also have chosen d(x,y) < d' , this is not important). 
And we define G T * T' iff there is B G A(M(T), M(T')) B |= 0. 

5.5.2 The algebraic limit 

There are basic problems with the algebraic limit in general preferential structures. 
Example 5.5.1 

Let a ^ 6, a ^ c, b < d, c ~< d (but -< not transitive!), then {a, b} and {a, c} are such S and S", but there is no S" C Sn S' 
which is an initial segment. If, for instance, in a and b ip holds, in a and c tp', then "in the limit" tp and tp' will hold, but 
not tp Aip'. This does not seem right. We should not be obliged to give up tp to obtain tp' . □ 



When we look at the system of such S generated by a preferential structure and its algebraic properties, we will therefore 
require it to be closed under finite intersections, or at least, that if S, S' are such segments, then there must be S" C S(lS' 
which is also such a segment. 

We make this official. Let A(X) be the set of initial segments of X, then we require: 
(An) If A,B G A(X) then there is C C A n B, C G A(A). 

More precisely, a limit should be a structural limit in a reasonable sense - whatever the underlying structure is -, and the 
resulting algebraic limit should respect (An). 

We should not demand too much, either. It would be wrong to demand closure under arbitrary intersections, as this would 
mean that there is an initial segment which makes all consequences true - trivializing the very idea of a limit. 

But we can make our requirements more precise, and bind the limit variant closely to the minimal variant, by looking at 
the algebraic version of both. 

Before we look at deeper problems, we show some basic facts about the algebraic limit. 
Fact 5.5.1 

(Taken from |Sch04j . Fact 3.4.3, Proposition 3.10.16 there.) 

Let the relation -< be transitive. The following hold in the limit variant of general preferential structures: 

(1) If A G A(Y), and A C X C Y, then A G A(X). 

(2) If A G A(Y), and A C X C Y, and B G A(X), then A n B G A(Y). 

(3) If A G A(Y), S G Apf), then there isZCAUBZG A(Y U A). 

The following hold in the limit variant of ranked structures without copies, where the domain is closed under finite unions 
and contains all finite sets. 

(4) A, B G ApT) => A C S or B C A, 

(5) A g A(A), y cx, yni^K^ynie A(Y), 

(6) A C A(A), n A' + =► A' G A(X). 
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Proof 

(1) trivial. 

(2) 

(2.1) A n B is closed in Y : Let (x, i) G A n -B, (y, j) -< (a;, i), then (y,j) G A. If (y, j) g X, then (y, j) A, contradiction. 
So (y,j) G X, but then (y,j) G B. 

(2.2) ifl5 minimizes Y : Let (a, i) G F. 

(a) H(o,i)eA-BC X, then there is -< (a,i), (y,j) G B. Xy closure of A, G A. 

(b) If (a, i) g" A, then there is (a', i') 6 A C X, (a' , i') -< (a, i), continue by (a). 
(3) 

Let Z := {{x,i) G A: -3{b,j) d (x,i).{b,j) G X - B} U {(y, j) G B: -3{a,i) d (y,j).(a,i) EY - A}, where ^ stands for 
-< or = . 

(3.1) Z minimizes Y U X : We consider Y, X is symmetrical. 

(a) We first show: If (a, k) G A—Z, then there is (y,i) G Z.(a,k) >- (y,i).Broof: li(a,k) G A— Z, then there is d (a,k), 
(b,j) G X— B. Then there is (y,i) -< (b,j), (y,i) G B. Xut {y,i) G Z, too: If not, there would be (a',k') d (y,i), 
(a 1 , k') G Y — A, but (a', k') -< {a, k), contradicting closure of A. 

(b) If (a", k") G y - A, there is (a, fc) G A, (a, k) -< (a", k"). If (a, jfe) Z, continue with (a). 

(3.2) Z is closed in V U X : Let then (z, j) G Z, (u, k) -< (z, £), (it, fc) G y U X. Suppose (z, - the case (z, i) G B is 
symmetrical. 

(a) (tt, fc) E Y — A cannot be, by closure of A. 

(b) (u, fc) E X — B cannot be, as (z, i) G Z, and by definition of Z. 

(c) If (w, fc) G A — Z, then there is (u, Z) ^ (u, fe), (v, I) G X— B, so (v, I) -< {z, i), contradicting (b). 

(d) If (u,k) G B— Z, then there is (v, I) d (u,k), (v,l) G Y — A, contradicting (a). 

(4) Suppose not, so there are a G A — B, b E B — A. But if a±b, a G B and b G A, similarly if a -< b or b -< a. 

(5) As A G A(X) and Y C X,Y n A is downward and horizontally closed. As 7 fl i / (3, Y Ci A minimizes Y. 

(6) P| A' is downward and horizontally closed, as all A G A' are. As |~| A' ^ 0, f] A' minimizes X. 

(7) Set B := {6 G Y : 3a G A.a_L6 or 6 < a} 
□ 



We have as immediate logical consequence: 
Fact 5.5.2 

(Fact 3.4.4 of |Sch04j .) 

If -< is transitive, then in the limit variant hold: 

(1) (AND), 

(2) (OR). 

Proof 

Let Z be the structure. 

(1) Immediate by Fact I5XT1 (page |TU2|) . (2) - set A = B. 

(2) Immediate by Fact I5XT1 fpage |TU2|l . (3). □ 



5.5.3 The logical limit 

5.5.3.1 Translation between the minimal and the limit variant 



A good example for problems linked to the translation from the algebraic limit to the logical limit is the property (\i =) 
of ranked structures: 

(u =) X C Y, fx(Y) nl/fl^ u(Y) nl = u(X) 
or its logical form 

( |^=) T h T', Con(f^, T) =^ T = FUT. 

/i(y) or its analogue T' (set A := M(T), Y := M(T')) speak about the limit, the "ideal", and this, of course, is not what 
we have in the limit version. This limit version was intoduced Dreciselv to avoid SDeakina: about the ideal. 
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VB e A{Y).BDX ^ 0. 

In logical terms, we have replaced the set of consequences of Y by some Th(B) where T 1 C Th(B) C T'. The conclusion 
can now be translated in a similar way to VS G A(Y)3A G A(X).A C BOX and VA G ApQ.BB G A(Y).B RICA The 
total translation reads now: 

(A =) Let X CY. Then 

Nb g A(Y).B fll^fl) =S> Nb e A(Y)3A e A(X).A CBniandVie A(X)3B e A(Y).B niuj. 

By Fact 15.5. II fpage fT02| (5) and (7), we see that this holds in ranked structures. Thus, the limit reading seems to provide 
a correct algebraic limit. 

Yet, Example 15.5.21 (page 1104)) below shows the following: 

Let m' ^ to be arbitrary. For T := Th({m,m'}), T := 0, we have T h T, T = Th({m'}), T = Th({m}), Con(T,T'), but 

Th({m}) = Tl>T' ^W. 

Thus: 

(1) The prerequisite holds, though usually for A G A(T), A n M(T') = 0. 

(2) (PR) fails, which is independent of the prerequisite Con(T,T'), so the problem is not just due to the prerequsitc. 

(3) Both inclusions of (|~=) fail. 

We will see below in Corollary 15. 5. 61 (page [TUT)) a sufficient condition to make ( |~=) hold in ranked structures. It has to do 

with definability or formulas, more precisely, the crucial property is to have sufficiently often A flM(T') — Ad M(T') 
for A G A(T) - see Section [5.4. II (page [93)) for reference. 

Example 5.5.2 

(Taken from |Sch04j . Example 3.10.1 (1) there.) 

Take an infinite propositional language pi : i G u>. We have u>i models (assume for simplicity CH). 

Take the model to which makes all pi true, and put it on top. Next, going down, take all models which make po false, and 
then all models which make po true, but p\ false, etc. in a ranked construction. So, successively more pi will become (and 
stay) true. Consequently, |=a Pi for all i. But the structure has no minimum, and the "logical" limit m is not in the set 

wise limit. Let T := and m' ^ to, T := Th({m, to'}), then T = Th({m}), T = Th({m'}), and T U T = T = Th({m'}) 

and f U T> = T = Th({m}). □ 



This example shows that our translation is not perfect, but it is half the way. Note that the minimal variant faces the 
same problems (definability and others), so the problems are probably at least not totally due to our perhaps insufficient 
translation. 

We turn to other rules. 

(An) If A, B G Apr) then there is C C A n B, C £ ApT) 

seems a minimal requirement for an appropriate limit. It holds in transitive structures by Fact I5.5TD (page I102p (2). 
The central logical condition for minimal smooth structures is 

(CUM) Tcrcf = f^ 

It would again be wrong - using the limit - to translate this only partly by: If T C T' C T, then for all A G A(M(T)) there 
is B G A(M(T')) s.t. A C B - and vice versa. Now, smoothness is in itself a wrong condition for limit structures, as it 
speaks about minimal elements, which we will not necessarily have. This cannot guide us. But when we consider a more 
modest version of cumulativity, we see what to do. 

(CUMfin) If T |- 0, then T = TU {</>}. 

This translates into algebraic limit conditions as follows - where Y = M(T), and X = M(T U {<fi}) : 
(ACUMfin) Let X C Y. If there is B G A(Y) s.t. B C X, then: 

(VA G A(X)3B' G A(Y).£' C A and VB' G A(Y)3A G A(Z).^4 C B') . 

Note, that in this version, we do not have the "ideal" limit on the left of the implication, but one fixed ap proxim ation 
B G A(Y). We can now prove that {ACUMfin) holds in transitive structures: The first part holds by Fact 15. 5. H (page 
I102[) (2), the second, as B R B' G A(Y) by Fact 15.5.11 (page !102p (1). This is true without additional properties of the 
structure, which might at first sight seem surprising. But note that the initial segments play a similar role as the set of 
minimal elements: an initial segment has to minimize the other elements, just as the set of minimal elements in the smooth 
case does. 

The central algebraic property of minimal preferential structures is 
{^PR) X C Y => u{Y) niC u(X) 
This translates naturallv and directlv to 



5.5. THE LIMIT VARIANT 

(APR) holds in transitive structures: Y - X G A(Y - A), so the result holds by Fact 15.5.11 fpage [TfT2")) (3). 
The central algebraic condition of ranked minimal structures is 

(n =) x c y, (x(Y) nx/N fi(Y) n x = n(x) 

We saw above how to translate this condition to (A =), we also saw that (A =) holds in ranked structures. 
We will see in Corollary 15. 5. 61 (page fT07| that the following logical version holds in ranked structures: 

T ^7 implies T = T U {7} 

We generalize above translation results to a recipe: 
Translate 

(1) n(X) C fi(Y) to VB e A(Y)3A e A(X).A C B, and thus 

(2) u(Y) nic a{X) to VA G A(X)3B G A(Y).B ma, 

(3) C y to 3A G A(A").A C y, and thus 

(4) ^(Y) n a ^ to vb e A(y).b n a ^ 

(5) A C /i(y) to vb g A(Y).x C B, 

and quantify expressions separately, thus we repeat: 

(6) [nCUM) fj,(Y) C X C y ^(X) = ^(Y) translates to 

(7) (ACUMfin) Let AT C Y If there is B G A(Y) s.t. BCI, then: 

(W G A(X)3B' G A(Y).B' C A and VB' G A(Y)3A G A(X).A C B') . 

(8) (/i =) A C y, M (Y) nl/N /i(Y) n A = translates to 

(9) (A =) Let X C y. If VB G A(Y).B n A ^ 0, then 

(VA G A(X)3B' G A(y).B' nlU, and VB' G A(Y)3A G A(X).4 CB'nx). 

We collect now for easier reference the definitions and some algebraic properties which we saw above to hold: 
Definition 5.5.2 

(An) If A, B G A(X) then there is C C A n B, C G A(A"), 

(APR) icy^vie A(x)3B g A(y).B nxa, 

(ACUMfin) Let X C Y If there is B G A(Y) s.t. BCI, then: 
(VA G A(X)3B' G A(Y).B' C A and VB' e A(Y)3A G A(AT).4 C B'). 

(A =) Let X cy. K VB e A(Y).B n A ^ 0, then 

(VA G A(X)3B' G A(Y).B' nlU, and VB' e A(Y)3A G A(X).A Cfl'nJf). 
Fact 5.5.3 

In transitive structures hold: 

(1) (An) 

(2) (APR) 

(3) (ACUMfin) 

In ranked structures holds: 

(4) (A =) 

Proof 

(1) By Fact EXT] (page [lni (2). 

(2) Y - X G A(Y - X), so the result holds by Fact I5XT1 fpaee ITPSl) (3). 

(3) By Fact HXl] (page [ini (1) and (2). 

(4) By Fact EXT] (page rrni (5) and (7). 
□ 
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Just as in the minimal case, the algebraic laws may hold, but not the logical ones, due in both cases to definability problems. 
Thus, we cannot expect a clean proof of correspondence. But we can argue that we did a correct translation, which shows 
its limitation, too. The part with fJ,(X) and fJ,(Y) on both sides of C is obvious, we will have a perfect correspondence. 
The part with X C fJ,(Y) is obvious, too. The problem is in the part with n(X) C Y. As we cannot use the limit, but 
only its approximation, we are limited here to one (or finitely many) consequences of T, if X = M(T), so we obtain only 
T f~ cj>, if Y C M{4>), and if there is A e A(X).A CY. 

We consider a limit only appropriate, if it is an algebraic limit which preserves algebraic properties of the minimal version 
in above translation. 

The advantage of such limits is that they allow - with suitable caveats - to show that they preserve the logical properties 
of the minimal variant, and thus are equivalent to the minimal case (with, of course, perhaps a different relation). Thus, 
they allow a straightforward trivialization. 

5.5.3.2 Logical properties of the limit variant 

We begin with some simple logical facts about the limit version. 
We abbreviate A(T) := A(M(T)) etc., assume transitivity. 

Fact 5.5.4 

(1) A G A(T) =$> M(T) C A 

(2) M(T) = f){A^ : A e A(T)} 

(2a) M(¥) \= a => 3B G A(T'). B \= a 

(3) M{¥) n M(T) h cr =>3B e A(T')- B nM(T) (= a. 

Proof 

(1) 

Note that A|=0^T|~c/>by definition, see Definition 1 5. 5. II (page llOlj) . 

Let M (T) % ^4^, so there is <f>, A \= <j>, so A \= <j>, but M (T) Y= 4>, so T \/> cj>, contradiction. 

(2) " C " by (1). " D ": Let x G : A G A(T)} \/A G A(T).x |= Tft(A) =^ih^ 
(2a) M{W) ^ a ^ T' |~ cr G A(T').B \= a. But B |= cr ^> S |= cr. 

(3) M(F) n M (T) ^ cr ^ FuT h cr ^ 3ri...r n gF s.t. T U {n, . . . ,t„} h cr, so 3B G A{T').Th{B) U T h cr. So 

M(Th(B)) n M(T) |= cr =r* S nM (T) |= cr. 
□ 



We saw in Example 15.5.21 (page 1104)) and its discussion the problems which might arise in the limit version, even if the 
algebraic behaviour is correct. 

This analysis leads us to consider the following facts: 
Fact 5.5.5 



(1) Let \/B G A(T')BA G A(T).A C B n M (T), then T'UTCT. 

Let, in addition, {B G A(T') : ^nSfT) = B n M (T)} be cofinal in A(T'). Then 

(2) Con(f^, T) implies VA G A(T').v4 n M (T) ^ 0. 



(3) VA G A(T)3B G A(T').B n M(T) C A implies T CT'UT. 

Note that M(T) — M(T), so we could also have written B RM(T) = B n M(T), but above way of writing stresses more 
the essential condition X n Y" = A D Y . 

Proof 

fll Let FuT h cr. so 3B G A(T'). B nM(T) \= a bv Fact 15331 Cmec fTDBI). f3) above (usine comnactness). Thus 
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(2) Let Con(¥,T), so M(¥) n M(T) ^ 0. M(P) = f)(~A^ ■ A G A ( r ')} b Y Fact [5X3 (page ITOBTl (2), so VA £ 
A(T').^4 N nA/(T) ^ 0. As cofinally often ^fW(T) = Xr7M(T), VA G A(f).inM(T) ^ 0, so G A(T').j4 n 
M(T) ^ by = 0. 

(3) Let a £ T, so T |~ cr, so e A(T).A |= cr, so 3B £ A(T').B(lM (T) Ciby prerequisite, so 3B £ A(T'). (BnM (T) C A 
and ^HM(T) = B n M(T)) . So for such B '~lT nM(T) = B n M(T) C '"/T |= cr. By Fact 15X31 fpage HU5|) (1) 



M (T') C B , so M{T') n M(T) |= cr, so T' U T h cr. 
□ 



We obtain now as easy corollaries of a more general situation the following properties shown in [Sch04j by direct proofs. 
Thus, we have the trivialization results shown there. 

Corollary 5.5.6 

Let the structure be transitive. 

(1) Let {B £ A(T') : M(T) = B n M(T)} be cofinal in A(T'), then 



(PR) Thf^TCT'ur holds. 



(2) <f> A </>' C (f> U {0'} holds. 

If the structure is ranked, then also: 

(3) Let {B £ A(T') : ^nM(l] = b77a7(T)} be cofinal in A(T'), then 



( |~=) T h T', Con(T', T) 4 T = T' U T holds. 



(4) T n 7 ^T = TU {7} holds. 
Proof 

(1) V.A e A{M(T))3B £ A(M(T')).BnM(T) C A by Fact 15X51 (page fTU5 ]> (2). So the result follows from Fact [5X5] (page 
IT06l) (3). 



(2) Set T' := {</>}, T := {</>,0'}. Then for B £ A(T') B nM(T) = B rW(^/) = B n M ((/)') by Fact [2XT] (page 
(C7 n +), so the result follows by (1). 

(3) Let Con(¥,T), then by Fact 15X51 (page fulfil) (2) VA £ A(T').A n M(T) 7^ 0, so by Fact 15X51 (page [105} (4) 
VB G A(T')3A G A(T).A C B n M(T), soFuTcf by Fact 15X51 fpage fiOfil) (1). 

The other direction follows from (1). 

(4) Set T := T'U{j}. Then for B G A(T')^nM(T) = "iTnM(7) = bTiM^) again by Fact ED] (page [28]) (CZH+), 
so the result follows from (3). 

□ 



We summarize for easier reference here our main positive logical results on the limit variant o f genera l preferential structures 
where each model occurs in one copy only, Proposition 3.4.7 and Proposition 3.10.19 from [Sch04j : 

Proposition 5.5.7 

Let the relation be transitive. Then 

(1) Every instance of the the limit version, where the definable closed minimizing sets are cofinal in the closed minimizing 
sets, is equivalent to an instance of the minimal version. 

(2) If we consider only formulas on the left of |~, the resulting logic of the limit version can also be generated by the 
minimal version of a (perhaps different) preferential structure. Moreover, the structure can be chosen smooth. 

Proposition 5.5.8 

When considering just formulas, in the ranked case without copies, A is equivalent to /i - so A is trivialized in this case. 
More precisely: 

Let a logic <f> |~ ip be given by the limit variant without copies, i.e. by Definition 15.5.11 (page 1 1 1"|) . Then there is a ranked 
structure, which gives exactly the same logic, but interpreted in the minimal variant. 

(As Example 3.10.2 in [Sch04j has shown, this is NOT necessarily true if we consider full theories T and T |~ ip.) 
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ourselves to formulas, we are much more shortsighted, and see only a blurred picture. In particular, we can make sequences 
of models to converge to some model, but put this model elsewhere. Suitable such manipulations will pass unobserved by 
formulas. The example also shows that there are structures whose limit version for theories is unequal to any minimal 
structure. 

(The negative results for the general not definability preserving minimal case apply also to the general limit case - see 
Section 5.2.3 in |Sch04j for details.) 



Chapter 6 

Higher preferential structures 



6.1 Introduction 

Definition 6.1.1 

An IBR is called a generalized preferential structure iff the origins of all arrows are points. We will usually write x, y etc. 
for points, a, etc. for arrows. 

Definition 6.1.2 

Consider a generalized preferential structure X. 

(1) Level n arrow: 
Definition by upward induction. 

If a : x — ► y, x, y are points, then a is a level 1 arrow. 

If a : x — ► 0, x is a point, a level n arrow, then a is a level n + 1 arrow. (o(a) is the origin, d(a) is the destination of a.) 
A (a) will denote the level of a. 

(2) Level n structure: 

X is a level n structure iff all arrows in X are at most level n arrows. 
We consider here only structures of some arbitrary but finite level n. 

(3) We define for an arrow a by induction O(a) and D(a). 
If X(a) = 1, then 0(a) := {o(a)}, D(a) := {d(a)}. 
liaix^P, then D(a) := D(0), and 0(a) := {x} U 0(0). 

Thus, for example, if a : x — > y, : z — > a, then 0(0) :— {x, z}, D(0) = {y}. 

Comment 6.1.1 

A counterargument to a is NOT an argument for ->a (this is asking for too much), but just showing one case where ->a 
holds. In preferential structures, an argument for a is a set of level 1 arrows, eliminating ->a— models. A counterargument 
is one level 2 arrow, attacking one such level 1 arrow. 

Of course, when we have copies, we may need many successful attacks, on all copies, to achieve the goal. As we may have 
copies of level 1 arrows, we may need many level 2 arrows to destroy them all. 

We will not consider here diagrams with arbitrarily high levels. One reason is that diagrams like the following will have 
an unclear meaning: 

Example 6.1.1 

(a, 1) : x -> y, 

(a, n + 1) : x — > (a, n) (n 6 ui). 
Is ye n{X)l 

Definition 6.1.3 

Let X be a generalized preferential structure of (finite) level n. 
We define (by downward induction): 
(1) Valid X — to — Y arrow: 
Let I,yc P(X). 

a e A(X) is a valid X — to — Y arrow iff 
(1.1) 0(a) C X, D(a) C Y. 
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We will also say that a is a valid arrow in X, or just valid in X, iff a is a valid X — to — X arrow. 

(2) Valid X => F arrow: 

Let X C Y C P(A"). 

a £ A(A?) is a valid X V arrow iff 

(2.1) o(a) £ X, 0(a) C F, D(or) C F, 

(2.2) V/3 : x' -> a.(x' £ F 3 7 : x" -> /3.( 7 is a valid X => F arrow)). 

(Note that in particular 0(7) € X, and that o(j3) need not be in X, but can be in the bigger F.) 
Fact 6.1.1 

(1) If a is a valid X => Y arrow, then a is a valid Y — to — Y arrow. 

(2) If X C X' C Y' C F C P(#) and a £ A(#) is a valid X => F arrow, and 0(a) C F', D(a) C F', then a is a valid 
X' => F' arrow. 

Proof 

Let a be a valid X => F arrow. We show (1) and (2) together by downward induction (both are trivial). 

By prerequisite o(a) e X <Z X' , 0(a) C F' C F, £>(<*) CfCF. 

Case 1: A(a) = n. So a is a valid X' => Y' arrow, and a valid Y — to — Y arrow. 

Case 2: \(a) = n — 1. So there is no /3 : x' — * a, y £ F, so a is a valid F — to — Y arrow. By F' C F a is a valid X' => Y' 
arrow. 

Case 3: Let the result be shown down to to, n > m > 1, let X(a) = m — 1. So V/3 : x' — ► a(x' £ F => 3 7 : x" — > /3(x" £ X 
and 7 is a valid X =>• F arrow)). By induction hypothesis 7 is a valid F — to — Y arrow, and a valid X' =*> F' arrow. So a 
is a valid F — to — F arrow, and by F' G F, a is a valid X' =>• F' arrow. 
□ 



Definition 6.1.4 

Let X be a generalized preferential structure of level n, X C P(< ; f). 
/z(X) := {x £ X : 3(x, i).-i3 valid X — to — X arrow a : x' — > (x, «)}. 

Comment 6.1.2 

The purpose of smoothness is to guarantee cumulativity. Smoothness achieves Cumulativity by mirroring all information 
present in X also in fJ-(X). Closer inspection shows that smoothness does more than necessary. This is visible when there 
are copies (or, equivalently, non-injective labelling functions). Suppose we have two copies of x £ X, (x,i) and (x, £'), and 
there is y £ X, a : (y,j) — > (x,i), but there is no a' : (y',j') — > (x,i'), y' £ X. Then a : (y,j) — ► (x, z) is irrelevant, 
as x £ fJ,(X) anyhow. So mirroring a : (y,j) — > (x,i) in (J,(X) is not necessary, i.e. it is not necessary to have some 
c/:<y',/}^(x,*),y'£/i(J0. 

On the other hand, Example 16.1.31 (page 1 1 13[) shows that, if we want smooth structures to correspond to the property 
(fiCUM), we need at least some valid arrows from fi(X) also for higher level arrows. This "some" is made precise 
(essentially) in Definition 16.1.51 (page II 10[) . 

From a more philosophical point of view, when we see the (inverted) arrows of preferential structures as attacks on non- 
minimal elements, then we should see smooth structures as always having attacks also from valid (minimal) elements. So, 
in general structures, also attacks from non- valid elements are valid, in smooth structures we always also have attacks from 
valid elements. 

The analogon to usual smooth structures, on level 2, is then that any successfully attacked level 1 arrow is also attacked 
from a minimal point. 

Definition 6.1.5 

Let X be a generalized preferential structure. 
X C X 1 iff 

(1) ICI'C P(X), 

(2) Vx £ X' - X V(x, i) 3a : x' — > (x, i)(a is a valid X => X' arrow), 

(3) Vx £ X 3(x,i) 

(Va : x' -> (x, i)(x' £ X' =^ 3[3 : x" -> a.((3 is a valid X =*> X' arrow))). 
Note that (3) is not simply the negation of (2): 

Consider a level 1 structure. Thus all level 1 arrows are valid, but the source of the arrows must not be neglected. 

(2) reads now: Vx £ X ' - X V(x, i) 3a : x 1 — > (x, i).x' £ X 

(3) reads: Vx £ X 3(x,i) ->3a : x' — > (x,i).x' £ V 
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Remark 6.1.2 

We note the special case of Definition 16.1.51 (page 1 110[) for level 3 structures, as it will be used later. We also write it 
immediately for the intended case n(X) C X, and explicitly with copies. 

x G /i(X) iff 

(1) 3(x,i)V(a,k) : (y,j) -> (x,i) 

(yeX^ 3(/3', I') : (z',m') -> (a,k). 

(z> G /i(X) A -3( 7 ', n') : - (/?',/>' G X)) 

See Diagram 16. 1.11 fpage lllip . 
x G X - n{X) iff 

(2) V(x > i)3(a',k'):(y',j')^(x,i) 

(y> G m(X) A 

(a) -.3^, I') : (z', m'} -> (a', fc').z' G X 
or 

(b) V{(3',l'):(z',m')^(a>,k>) 

{z'eX^ 3( 7 ',n') : (u',j/> - (/?', V).u> G M (X)) ) 
See Diagram EH1 (page Ell). 




Case 3-1-2 



Diagram 6.1.1 
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Diagram 6.1.2 Case 3-2 



Fact 6.1.3 

(1) If X C X', then X = f-i(X'), 

(2) X HX', X CX" <ZX' ^ X H X". (This corresponds to (fiCUM).) 

(3) X C X', X C Y', Y C Y', Y C X' X = Y. (This corresponds to (// CD).) 

Proof 
Proof 

(1) Trivial by Fact KTl~Tl (page ITTO f (1). 
(2) 

We have to show 

(a) Va: G X" — X V(a;, i) 3a : x' — > (a;. i)(a is a valid X =>■ X" arrow), and 

(b) Viel 3(3, i) (Va : x' -> (a:, i)(ar' G X" =*> 3/3 : jc" -> a.(/3 is a valid X =*> X" arrow))). 

Both follow from the corresponding condition for X X', the restriction of the universal quantifier, and Fact lG.l.TI (page 

run (2). 

(3) 

Let x G X-Y. 

(a) ByielC X', 3(a;,i) s.t. (Va : af' -> (x,i)(x' E X' ^ 3(3 : x" -> a.(/3 is a valid X =>■ X' arrow))). 

(b) By x ^ y C 3ai : x' — > (x, i) ai is a valid y =>- F' arrow, in particular x' E Y C X' . Moreover, A(ai) = 1. 
So by (a) 3/32 : x" — ► ai.(/?2 is a valid X =>■ X' arrow), in particular a;" G X C y', moreover A(/3a) = 2. 

It follows by induction from the definition of valid A =>■ i? arrows that 
Vn3a2 m +i, A(a2 m +i) = 2m + 1, a 2m +i a valid Y => y' arrow and 
Vn3/3 2m+2 , A(/3 2m+2 ) = 2m + 2, /3 2m+ 2 a valid X X' arrow, 
which is impossible, as A" is a structure of finite level. 
□ 



Definition 6.1.6 
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X is called totally smooth for X iff 

(1) Va:i^i/e A{X){0{a) U D{a) C X => 3a' : x' -> y.x' € /xpQ) 

(2) if a is valid, then there must also exist such a! which is valid, 
(y a point or an arrow). 

If y C P(X), then X is called ^- totally smooth iff for all X e y X is totally smooth for X. 
Example 6.1.2 

X := {a : a — > b, a' : b — > c, a" : a — > c, [3 : b — > a'} is not totally smooth, 

X := {a : a — > b, a' : b — * c, a" : a — > c, [3 : b — ► a', /?' : a — > a'} is totally smooth. 

Example 6.1.3 

Consider a' : a —> b, a" : b —> c, a : a —> c, (3 : a a. 

Then /i({a, 6, c}) = {a}, //({a,c}) = {a, c}. Thus, (fiCUM) does not hold in this structure. Note that there is no valid 
arrow from /i({a, 6, c}) to c. 

Definition 6.1.7 

Let X be a generalized preferential structure, X C P(Af). 
X is called essentially smooth for X iff C X. 

If y C P(A'), then * is called ^-essentially smooth iff for all X £ y fj,(X) C X. 
Example 6.1.4 

It is easy to see that we can distinguish total and essential smoothness in richer structures, as the following Example shows: 
We add an accessibility relation R, and consider only those models which are accessible. 

Let e.g. a — » b — ► (c, 0), (c, 1), without transitivity. Thus, only c has two copies. This structure is essentially smooth, but 
of course not totally so. 

Let now mRa, mRb, mR(c,0), mR(c. 1), m'Ra, m! Kb, m'R(c,0). 

Thus, seen from m, ^({a, b, c}) = {a, c}, but seen from m', M({a, b, c}) = {a}, but £t({a, c}) = {a, c}, contradicting (CUM). 
□ 



6.2 The general case 



The idea to solve the representation problem illustrated by Example 12.3.21 (page [35]) is to use the points c and d as bases 
for counterarguments against a : b — > a - as is possible in IBRS. We do this now. We will obtain a representation for logics 
weaker than P by generalized preferential structures. 

We will now prove a representation theorem, but will make it more general than for preferential structures only. For this 
purpose, we will introduce some definitions first. 

Definition 6.2.1 

LetT],p-.y ->V(U). 

(1) If X is a simple structure: 

X is called an attacking structure relative to ij representing p iff 
p{X) = {x G f)(X) : there is no valid X — to — r/(X) arrow a : x' — ► x} 
for all X ey. 

(2) If X is a structure with copies: 

X is called an attacking structure relative to r\ representing p iff 

p(X) — {x £ T)(X) : there is (x, i) and no valid X — to — rj(X) arrow a : (x', i') — > (x, i)} 
for all X ey. 

Obviously, in those cases p(X) C rj{X) for all X ey. 
Thus, X is a preferential structure iff r\ is the identity. 
See Diagram 16.2.11 (page I113[) 
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Attacking structure 



Diagram 6.2.1 



(Note that it does not seem very useful to generalize the notion of smoothness from preferential structures to general 
attacking structures, as, in the general case, the minimizing set X and the result p(X) may be disjoint.) 

The following result is the first positive representation result of this paper, and shows that we can obtain (almost) anything 
with level 2 structures. 

Proposition 6.2.1 

Let 77, p : y — > V(U). Then there is an attacking level 2 structure relative to r] representing p iff 

(1) p(x) c n{x) for ail x e y, 

(2) p(0) = 77(0) if0ey. 

(2) is, of course, void for preferential structures. 
Proof 

(A) The construction 

We make a two stage construction. 

(A.l) Stage 1. 

In stage one, consider (almost as usual) 
U := (X, {cti : i E I}) where 

X := {(x, f):x€U,f€ Tl{X E y : x E V (X) - p(X)}}, 

a : x' — » (x, /) :<^ x' E ran(f). Attention: x' E X, not x' E p{X)\ 

(A.2) Stage 2. 

Let X' be the set of all (a;, /, X) s.t. (a;, /) E X and 

(a) either X is some dummy value, say * 
or 

(b) all of the following (1) - (4) hold: 

(1) XEy, 

(2) x E p(X), 

(3) there is X' C X, x E f]{X') - p{X'), X' E y, (thus ran(f) n X ^ by definition), 

(4) VX" e y.(X C X" , x E r){X") - p(X") {ran(f) n X") - X ^ 0). 

(Thus, / chooses in (4) for X" also outside X. If there is no such X", (4) is void, and only (1) — (3) need to hold, i.e. we 
may take any f with (x, f) E X.) 




Note: If (1) — (3) are satisfied for x and A, then we will find / s.t. (x, f) G X, and (x, f, A) satisfies (I) — (4) : As X ^ X 
for X" as in (4), we find / which chooses for such X" outside of X. 

So for any (x, /} € X, there is (x, f, *), and maybe also some (x, /, X) in X' . 

Let again for any x' , (x. f, X) G X' 

u:x'^> (x, /, X) :<^> x' G ran(f) 

(A. 3) Adding arrows. 

Consider x 1 and (x,f,X). 

If X = *, or a;' G" A, we clo nothing, i.e. leave a simple arrow a : x' — > {x, /, A) <^> x' G ran(f). 

If X E y, and a;' G A, and x' G ran(f), we make A many copies of the attacking arrow and have then: (a,x") : x' — 
(x, /, A) for all x" G A. 

In addition, we add attacks on the (a,x") : ((3,x") : x" — > (a,x") for all ir" G A. 
The full structure Z is thus: 
A"' is the set of elements. 

If x' G ran(f), and A = * or x' A then a : x' — > (x, /, A) 
if x' G ran(f), and A ^ * and x' G A then 

(a) {a, x") : x' -> (or, /, A) for all x" G A, 

(b) {{3, x") : x" (a, a;") for all x" G A. 
See Diagram 16.2.31 (page lf 15[> . 
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Attacking structure 



Diagram 6.2.3 

(B) Representation 

We have to show that this structure represents p relative to 77. 
Let y G 77(F), Y G y. 
Case 1. y G p(Y). 

We have to show that there is (y,g, Y") s.t. there is no valid a : y' — > (y, y, Y"), y' G Y. In Case 1.1 below, Y" will be *, 
in Case 1.2, y" will be F, g will be chosen suitably. 

Case 1.1. There is no F' C F, y G r){Y') - p(Y'), Y' G J 7 . 

So for all F' with y G r?(F') -p(F') F' - F ^ 0. Let y G n{F' - F : y e r/(F') - p(Y')}. Then ran(.g)nF = 0, and (y,g,*) 
is not attacked from F. ((y,y) was already not attacked in A\) 

Case 1.2. There is F' C F, j/ G r/(F') - p(Y'), Y' G 3>. 

Let now (y,y,F) G A", s.t. g(Y") # Y if F C F", y G ij(y") - p(F"), F" G y. As noted above, such g and thus <y,y,F> 
exist. Fix (y,g,Y). 

Consider any y' G ran(g). If y' ^ F, y' does not attack (y, g, Y) in F. Suppose y' G F. We had made F many copies (a, y"), 
y" G F with (a, y") : y' — > (y, y, F) and had added the level 2 arrows (/3, y") : y" — > (a, y") for y" G F. So all copies (a, y") 
are destroyed in F. This was done for all y' G F, y' G ran(g), so (y, y, F) is now not (validly) attacked in F any more. 

Case 2. y G 77(F) - p(Y). 

Let (y,g,Y') (where Y' can be *) be any copy of y, we have to show that there is z G F, a : z — > (y, y, F'), or some 
(a, z') : z — > (y, y, F'}, z' G F', which is not destroyed by some level 2 arrow (/3, z'} : z' — > (a, z'), z' G F. 

As y G 77(F) — p(F), ran(g) n F 7^ 0, so there is z G ran(g) n F. Fix such z. (We will modify the choice of z only in Case 
2.2.2 below.) 

Case 2.1. F' = *. 

As z G ran(g), a : z — > (y,y, *). (There were no level 2 arrows introduced for this copy.) 
Case 2.2. F' ^ *. 

So (y, y, Y') satisfies the conditions (1) — (4) of (b) at the beginning of the proof. 

If z £ F', we are done, as a : z — > (y, y, F'), and there were no level 2 arrows introduced in this case. If z G F', we had made 
F' many copies (a,z'), (a,z') : z — > (y,y,F'), one for each z' G F'. Each (a,z') was destroyed by (/3, z') : z' — ► (a,z'), 

z'eY'. 

Case 2.2.1. F' g F. 

Let z" G F'-Y, then (a, z") : z -y (y, 3, F') is destroyed only by (/3, z") : z" -> (a, z") in F', but not in F, as z" £ F, so 
(y, g, Y') is attacked by (a, z") : z — > (y, y, F'), valid in F. 

Case 2.2.2. F^Y (7 = F is impossible, as y G p(Y'), y p(Y)). 

Then there was by definition (condition (b) (4)) some z' G (ran(g) (~l F) — F' and a : z' — > (y, g, F') is valid, as z' ^ F'. 
Qn this case, there are no coDies of a and no level 2 arrows.) 



6.3. DISCUSSION OF THE TOTALLY SMOOTH CASE 
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Corollary 6.2.2 

(1) We cannot distinguish general structures of level 2 from those of higher levels by their p— functions relative to 77. 

(2) Let U be the universe, y C V(U), p, : y — > V(U). Then any p satisfying (p C) can be represented by a level 2 
preferential structure. (Choose 77 — identity.) 

Again, we cannot distinguish general structures of level 2 from those of higher levels by their p— functions. 

□ 



A remark on the function 77 : 

We can also obtain the function 77 via arrows. Of course, then we need positive arrows (not only negative arrows against 
negative arrows, as we first need to have something positive). 

If 77 is the identity, we can make a positive arrow from each point to itself. Otherwise, we can connect every point to every 
point by a positive arrow, and then choose those we really want in 77 by a choice function obtained from arrows just as we 
obtained p from arrows. 

6.3 Discussion of the totally smooth case 

Fact 6.3.1 

Let A, Y G y, X a level n structure. Let (a,k) : (x,i) — > {y,j), where (y,j) may itself be (a copy of) an arrow. 

(1) Let n > 1, X C Y, (a, t)ela level n — 1 arrow in X\X. If (a, k) is valid in X\Y, then it is valid in X\X. 

(2) Let X be totally smooth, p(X) C Y, p(Y) C X, (a, k) G X\X n Y, then (a, k) is valid in X\X iff it is valid in X\Y. 
Note that we will also sometimes write X for X\X, when the context is clear. 

Proof 

(1) If (a, k) is not valid in X\X, then there must be a level n arrow ((3, r) : (z, s) — > (a, k) in X\X C X\Y. (j3, r) must be 
valid in X\X and X |~Y, as there are no level n+ 1 arrows. So (a, k) is not valid in X \Y, contradiction. 

(2) By downward induction. Case n : (a,k) G X\X n Y, so it is valid in both as there are no level n+ 1 arrows. Case 
771 — > 771 — 1 : Let (a, k) € X\X C\Y he a, level m — 1 arrow valid in X\X, but not in A"|~Y. So there must be a level m 
arrow (f3, r) : (z,s) — > (a, fc) valid in By total smoothness, we may assume z G p(Y) G X, so (/3,r) G A"[A is valid 
by induction hypothesis. So (a, k) is not valid in X\X, contradiction. 

□ 



Corollary 6.3.2 

Let X,Y G y, X a totally smooth level 71 structure, u(X) C y, /i(Y) C A. Then ^i(A) = p{Y). 
Proof 

Let a; G n(X) — p(Y). Then by p(X) G Y, x g Y, so there must be for all (x,i) G X an arrow (a, fc) : (y, j) — > (a;,i) valid 
in X\Y, wlog. 7/ G p(Y) Clby total smoothness. So by Fact 16. 3. II fpage fTT7| . (2), (a, k) is valid in X\X. This holds for 
all (x,i), so x G" m(A), contradiction. □ 



Fact 6.3.3 

There are situations satisfying (/1 C) + (pCUM) + (n) which cannot be represented by level 2 totally smooth preferential 
structures. 

The proof is given in the following example. 
Example 6.3.1 

Let Y := {x, y, y'}, X := {x, y}, X' := {x, y'}. Let y := V{Y). Let M (y) := {y, y'}, M (A) := /i(A') := {x}, and M (Z) := Z 
for all other sets. 

Obviously, this satisfies (fl), (fi C), and (pCUM). 

Suppose A" is a totally smooth level 2 structure representing p. 

So //(A) = /i(A') CY - mOO) M^O ^ A U X'. Let (x,i) be minimal in X\X. As (x,i) cannot be minimal in X\Y 1 there 
must be a : (z, 7) — » (.t, 7), valid in fy. 
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So a € X\X' . If a is valid in X\X', there must be a' : (x',i') — » (a;, i), x' G n{X'), valid in X\X', and thus in X\X, by 
= n(X') and Fact 16. 3. H (page fTT7|) (2). This is impossible, so there must be (3 : (x',i r ) — > a, a;' £ /i(A'), valid in 
A'fA'. As /3 is in and X a level < 2 structure, j3 is valid in X\Y, so a is not valid in X\Y, contradiction. 

Case 2: zel 

a cannot be valid in X\X, so there must be (3 : (x', i') — > a, x' G m(A), valid in X\X. Again, as /3 is in A"|~y and <Y a level 
< 2 structure, /3 is valid in X\Y, so a is not valid in X\Y, contradiction. 

□ 



It is unknown to the authors whether an analogon is true for essential smoothness, i.e. whether the re are exam ples of such 
u function which need at least level 3 essentially smooth structures for representation. Proposition 16.4.21 fpage TT!?!"]) below 
shows that such structures suffice, but we do not know whether level 3 is necessary. 



Above Example 16.3. II (page [TT7| can be solved by a totally smooth level 3 structure: 

Let ai : x — > y, a 2 : x -> y', a 3 : y -> x, (3i : y -> a 2 , : y' — > «i, /3 3 : V -* a> 3 , f3 4 ■ x a 3 , 7i : y' -> /3s, 72 : y' -»■ /?4- 
See Diagram 16.3.11 (page I118[) . 



The subdiagram generated by A contains ai, 03, /3s, /34. ai, /?3, /?4 are valid, so /i(A) = {x}. 
The subdiagram generated by X' contains ol 2 is valid, so (J-(X') = {x}. 
In the full diagram, a 3l (3\, 02, 71, 72 are valid, so u{Y) — {y, y'}. 
□ 

Remark 6.3.5 

Example 12.3.11 (page l35f together with Corollary 16.3.21 (page I117|) show that (u C) and (fj,CUM) without (n) do not 
guarantee representability by a level n totally smooth structure. 

6.4 The essentially smooth case 

Definition 6.4.1 

Let fx : y — > P(C/) and X be given, let a : {y, j) — > (x, i) G <Y. 
Define 

Ofa) := (y G y : x G F - u(Y), y G u(F)>, 



Fact 6.3.4 




Diagram 6.3.1 



Solution by smooth level 3 structure 
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n(o,o) := n{^(r) : y e o(a)}, 

H(D,a) := Tl{fi(X) : X e £>(«)}. 



Lemma 6.4.1 

Let U be the universe, // : y — > V(U). Let satisfy (/i C) + (// CD). 

Let A" be a level 1 preferential structure, a : (y,j) — > (x, i), 0(a) / 0, -D(a) ^ 0. 

We can modify A" to a level 3 structure X' by introducing level 2 and level 3 arrows s.t. no copy of a is valid in any 
X e D(a), and in every Y £ 0{a) at least one copy of a is valid. (More precisely, we should write X'\X etc.) 

Thus, in X' , 

(1) (x, i) will not be minimal in any Y £ 0(a), 

(2) if a is the only arrow minimizing (x, i) in X £ D(a), (x, i) will now be minimal in X. 
The construction is made independently for all such arrows a £ X . 

(This is probably the main technical result of the paper.) 



Proof 

(1) The construction 

Make 11(1), a) many copies of a : {(a, /) : / £ U(D, a)}, all (a, /) : (y, j) —> (x, i). Note that {a, /) £ X for all X £ D{a) 
and (a, f) <=Y for all Y £ 0{a). 

Add to the structure (f3, f,X r ,g) : (f(X r ),i r ) —> (a,f), for any X r £ D(pt), and g £ U(0,a) (and some or all i r - this 
does not matter). 

For all Y s £ O{o) : 

if fJ,(Y 8 ) % X r and f(X r ) £ Y s , then add to the structure (7, /, X r ,g,Y s ) : (g(Y s ),j s ) — » (f3, f,X r> g) (again for all or some 
3 s), 

if fi(Y s ) C X r or f{X r ) £ Y s , (7, /, Xr, g, Y s ) is not added. 
See Diagram 16.4. II fpage [TT9]) . 
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X G D(a) 




Y e O(a) 



The construction 



Diagram 6.4.1 

(2) 

Let X r G D(a). We have to show that no (a, /) is valid in X r . Fix /. 

(a,/) is in X r , so we have to show that for at least one g G 11(0, a) (j3, /, X r , g) is valid in X r , i.e. that for this g, no 

{-y,f,X r ,g,Y a ) : (ff(r a ), j a > - (f3,f,X r ,g), Y s G O(a) attacks ((3,f,X r ,g) in X r . 

We define Take y G O(a). 

Case 1: //(y s ) Q X r or f(X r ) ^ Y s : choose arbitrary g(Y s ) G n(Y a ). 

Case 2: /x(F s ) £ X r and /(X r ) G y : Choose g(Y s ) G /z(Y s ) - X r . 

In Case 1, ( r y,f,X r ,g,Y s ) does not exist, so it cannot attack ((3,f,X r ,g). 

In Case 2, (7, /, X r , g, Y s ) : {g(Y s ),j s ) -> (0,f,X r ,g) is not in X r , as #(Y S ) £ X r . 

Thus, no (j,f,X r ,g,Y s ) : (g(Y s ),j s ) -> (j3,f,X r ,g), Y s G O(a) attacks (f3J,X r ,g) in X r . 

SoV(a,/):<y,i>-><a;,i> 

y G X r 3(p,f,X r ,g) : </(X r ),i P ) - (a,/) 

(/(X r ) G /i(X r ) A -3( 7 ,/,X r , 5 ,y s ) : (<KY s ),j s ) -> (fi,f,X r ,g).g(Y 8 ) G X r ). 

But (/?, /, X r , was constructed only for (a, /), so was (7, /, X r , g, Y s ), and there was no other (7, i) attacking (/?, /, X r ,g), 
so we are done. 

(3) 

Let Y s G O(a). We have to show that at least one (a, /) is valid in Y s . 
We define / G U(D, a). Take X r . 

If fi(X r ) % Y s , choose f(X r ) G n(X r ) — Y s . If ii{X r ) C Y s , choose arbitrary f(X r ) G fJ,(X r ). 

All attacks on (x, /) have the form (0,f,X r ,g) : (f(X r ),i r ) — > (a,f), X r G D (a), ,g G n(0,a). We have to show that 
thcv arc cither not in y . or that thev are themselves attacked in Y„. 
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Case 2: M (X r ) C Y s . Then »(Y S ) % X r by (/x CD) and /(X r ) £ Y s , so {/3,f,X r ,g) : (f(X r ),i r ) -» (a,/) is in Y s (for 
all <?). Take any 5 £ n(0,a). As M (Y S ) % X r and /(X r ) £ Y s , (7, /, X r , g, Y s ) : (g{Y a ),j s ) -» ((3J,X r ,g) is defined, and 
g(Y s ) £ m(^s)j so it is in Y s (for all g). Thus, (f3, f, X r , g) is attacked in Y s . 

Thus, for this /, all ((3, f, X r , g) are either not in Y s , or attacked in Y s , thus for this /, (a, f) is valid in Y s . 
So for this (x, i) 

: (yj) -» (a?,i).» € MY,) A 

(a) -a(J3,f,X r ,g) : (f(X r ),i) -> («,/)./(!,) £7 S 

or 

(b) V{p,f,X r ,g}:{f(X r ),i)^{a,f) 

(f(X r ) eF s 4 

3<7,/,x r , ff ,y 5 ) : (.g(r s ),j s } -» (/3, /, x r , 5 ). 5 (Y s ) e K^))- 

As we made copies of a only, introduced only /3's attacking the a— copies, and 7's attacking the /3's, the construction is 
independent for different a's. 

□ 



Proposition 6.4.2 

Let U be the universe, /i : y — > V(U). 

Then any // satisfying (// C), (n), (fiCUM) (or, alternatively, (/it C) and (fj. CD)) can be represented by a level 3 essentially 
smooth structure. 

Proof 

In stage one, consider as usual U :— (X, {a t : i £ /}) where X := {(x, /) : x £ U, / £ IT{/i(X) ile^iel - ^t(A)}}, 
and set a : (x' , /') — > (x, /) :<^=S> x' G ran(f). 

For stage two: 

Any level 1 arrow a : (y,j) — > (x,i) was introduced in stage one by some 7 £ J s.t. y £ £t(Y), x £ Y — (J-{Y). Do the 
construction of Lemma 16.4.11 (page I119[) for all level 1 arrows of X in parallel or successively. 

We have to show that the resulting structure represents /i and is essentially smooth. (Level 3 is obvious.) 

(1) Representation 

Suppose x G Y — fJ*(Y). Then there was in stage 1 for all (x,i) some a : (y,j) — » (x, i), y G (J>(Y). We examine the y. 

If there is no X s.t. x G A*(X), y £ X, then there were no /3's and 7's introduced for this a : — > (x, i), so a is valid. 

If there is X s.t. x £ n(X), y £ A, consider a : — > So A £ 15(a), Y £ 0(a), so we did the construction of 

Lemma 16.4. II fpage !119p . and by its result, (x,i) is not minimal in Y. 

Thus, in both cases, (x,i) is successfully attacked in Y, and no (x,i) is a minimal clement in Y. 
Suppose x £ n(X) (we change notation to conform to Lemma [6.4. II (page [TT9|) ). Fix (x,i). 
If there is no a : (y,j) — > (x, i), y £ A, then (x, i) is minimal in A, and we are done. 

If there is a or (a, fc) : (y, j) — ► (x, i), y £ A, then a originated from stage one through some Y s.t. x £ Y — /i(Y), and 
y £ f-i(Y). (Note that stage 2 of the construction did not introduce any new level 1 arrows - only copies of existing level 
1 arrows.) So A £ D(a), Y £ O(o), so we did the construction of Lemma [6.4.11 (page [TO?)) , and by its result, (x, i) is 
minimal in A, and we are done again. 

In both cases, all (x, i) are minimal elements in A. 

(2) Essential smoothness. We have to show the conditions of Definition 16. 1.51 fpage lllO]) . We will, however, work with the 
reformulation given in Remark 16.1.21 (page llll[) . 

Case (1), x £ n(X). 

Case (1.1), there is (x, i) with no (a, f) : {y,j} ~ > (x,i), y £ A. There is nothing to show. 
Case (1.2), for all (x,i) there is (a, f) : (y,j) — > (x,i), y £ A. 

a was introduced in stage 1 by some Y s.t. x £ Y — /u(Y), y £ X n A*(Y), so A £ -D(a), Y £ O(o). In the proof of Lemma 
16.4.11 (page [TT9]) . at the end of (2), it was shown that 

3((3,f,X r ,g):(f(X r ),i r )^(a,f) 

(f(X r ) £ /z(X r ) A 

-B{ri,f,X T ,g,Y.) : (g(Y s ),j a ) -> <J3,f,X r ,g).g(y a )eX r ). 
By /(A,.) £ fx(X r ), condition (1) of Remark l6.1.2l fpage llll[ ) is true. 
Case (2), x £" /x(Y). Fix (x, i). (We change notation back to Y.) 
In stage 1, we constructed a : (y, j) — (x,i), w £ u(Y), so Y £ 0(a). 
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If D(a) ^ 0, we did the construction of Lemma [6.4.11 (page [TIT))) . so 
: (y,j) -> (x,i).y € /i(Y s ) A 

(a) -a{p,f,X r ,g) : (/(A r ),i) -> (a,/)./(I r ) eY 8 

or 

(b) V{p,f,X r ,g):{f(X r ),i)^{a,f) 

(f(X r ) eY s ^ 

3( 7 J,X r ,g,Y s ) : (g(Y s ),j s ) - ((3,f,X r ,g).g(Y s ) e M (F S ). 

As the only attacks on (a,/) had the form ((3,f,X r ,g), and g(i^) S condition (2) of Remark 16.1.21 (page lllip is 

satisfied. 

□ 



As said after Example 16.3.11 (page I117[) , we do not know if level 3 is necessary for representation. We also do not know 
whether the same can be achieved with level 3, or higher, totally smooth structures. 



6.5 Translation to logic 

We turn to the translation to logics. 
Proposition 6.5.1 

Let |~ be a logic for C. Set T M := T7j(/4.m(M(T))), where M. is a generalized preferential structure, and fiM its choice 
function. Then 

(1) there is a level 2 preferential structure M s.t. T = T M iff (LLE), (CCL), (SC) hold for all T, T' C C. 

(2) there is a level 3 essentially smooth preferential structure M s.t. T = T M iff (LLE), (CCL), (SC), (CD) hold for all 
T, V C £. 



Proof 

The proof is an immediate consequence of Corollary 16.2.21 (page 1117]) (2), Fact 16.1 ."31 (page 1 1 1 2j) . Proposition 16.4.21 (page 
EH) , and Proposition (page (10) and (11). 

(More precisely, for (2): Let M. be an essentially smooth structure, then by Definition 16. 1.71 (page lll3]) for all X /i(A) C X. 
Consider {pCUM). So by Fact El (page [BM (2) /t (X') C X" C X' => /x(X') C A", so by Fact 15X31 fpage [TT21) (1) 
/j(A') = m(A")- (A* CD) is analogous, using Fact 15X51 fpage [TT2")l (3). 
□ 



We leave aside the generalization of preferential structures to attacking structures relative to 77, as this can cause problems, 
without giving real insight: It might well be that p(X) $2 77(A), but, still, p(X) and 77(A) might define the same theory - 
see e.g. Example 14.2.11 (page [52]) . 



Chapter 7 

Deontic logic and hierarchical conditionals 



7.1 Semantics of deontic logic 
7.1.1 Introductory remarks 

We see some relation of "better" as central for obligations. Obligations determine what is "better" and what is "worse" , 
conversely, given such a relation of "better" , we can define obligations. The problems lie, in our opinion, in the fact that 
an adequate treatment of such a relation is somewhat complicated, and leads to many ramifications. 

On the other hand, obligations have sufficiently many things in common with facts so we can in a useful way say that an 
obligation is satisfied in a situation, and one can also define a notion of derivation for obligations. 

Our approach is almost exclusively semantical. 



7.1.1.1 Context 

The problem with formalisation using logic is that the natural movements in the application area being formalised do not 
exactly correspond to natural movements in the logic being used as a tool of formalisation. Put differently, we may be able 
to express statement A of the application area by a formula <f> of the logic, but the subtleties of the way A is manipulated 
in the application area cannot be matched in the formal logic used. This gives rise to paradoxes. To resolve the paradoxes 
one needs to improve the logic. So the progress in the formalisation program depends on the state of development of logic 
itself. Recent serious advances in logical tools by the authors of this paper enable us to offer some better formalisations 
possibilities for the notion of obligation. This is what we offer in this paper. 

Historically, articles on Deontic Logic include collections of problems, see e.g. MDW94 , semantical approaches, see e.g. 
[Han69j . and others, like [CJ02] , 

Our basic idea is to see obligations as tightly connected to some relation of "better" . An immediate consequence is that 
negation, which inverses such a relation, behaves differently in the case of obligations and of classical logic. ("And" and "or" 
seem to show analogue behaviour in both logics.) The relation of "better" has to be treated with some caution, however, 
and we introduce and investigate local and global properties about "better" of obligations. Most of these properties coincide 
in sufficiently nice situations, in others, they are different. 

We do not come to a final conclusion which properties obligations should or should not have, perhaps this will be answered 
in future, perhaps there is no universal answer. We provide a list of ideas which seem reasonable to us. 

Throughout, we work in a finite (propositional) setting. 

7.1.1.2 Central idea 

We see a tight connection between obligations and a relation of "morally" better between situations. Obligations are there 
to guide us for "better" actions, and, conversely, given some relation of "better" , we can define obligations. 

The problems lie in the fact that a simple approach via quality is not satisfactory. We examine a number of somewhat 
more subtle ideas, some of them also using a notion of distance. 

7.1.1.3 A common property of facts and obligations 

We are fully aware that an obligation has a conceptually different status than a fact. The latter is true or false, an obligation 
has been set by some instance as a standard for behaviour, or whatever. 

Still, we will say that an obligation holds in a situation, or that a situation satisfies an obligation. If the letter is in the 
mail box, the obligation to post it is satisfied, if it is in the trash bin, the obligation is not satisfied. In some set of worlds, 
the obligation is satisfied, in the complement of this set, it is not satisfied. Thus, obligations behave in this respect like 
facts, and we put for this aspect all distinctions between facts and obligations aside. 

Thus, we will treat obligations most of the time as subsets of the set of all models, but also sometimes as formulas. As we 
work mostly in a finite propositional setting, both are interchangeable. 

We are not concerned here with actions to fulfill obligations, developments or so, just simply situations which satisfy or 
not obligations. 
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closed under arbitrary right weakening. This article is only about the latter. 



7.1.1.4 Derivations of obligations 

Again, we are aware that "deriving" obligations is different from "deriving" facts. Derivation of facts is supposed to 
conclude from truth to truth, deriving obligations will be about concluding what can also be considered an obligation, 
given some set of "basic" obligations. The parallel is sufficiently strong to justify the double use of the word "derive" . 

Very roughly, we will say that conjunctions (or intersections) and disjunctions (unions) of obligations lead in a reasonable 
way to derived obligations, but negations do not. We take the Ross paradox (see below) very seriously, it was, as a matter 
of fact, our starting point to avoid it in a reasonable notion of derivation. 

We mention two simple postulates derived obligations should probably satisfy. 

(1) Every original obligation should also be a derived obligation, corresponding to a, (3 |~ a. 

(2) A derived obligation should not be trivial, i.e. neither empty nor U, the universe we work in. 

The last property is not very important from an algebraic point of view, and easily satisfiable, so we will not give it too 
much attention. 



7.1.1.5 Orderings and obligations 

There is, in our opinion, a deep connection between obligations and orderings (and, in addition, distances), which works 
both ways. 

First, given a set of obligations, we can say that one situation is "better" than a seond situation, if the first satisfies 
"more" obligations than the second does. "More" can be measured by the set of obligations satisfied, and then by the 
subset/superset relation, or by the number of obligations. Both arc variants of the Hamming distance. "Distance" between 
two situations can be measured by the set or number of obligations in which they differ (i.e. one situation satisfies them, 
the other not). In both cases, we will call the variants the set or the counting variant. 

This is also the deeper reason why we have to be careful with negation. Negation inverses such orderings, if cf) is better 
than then -k/> is worse than -i-xf> = 4>. But in some reasonable sense A and V preserve the ordering, thus they are 
compatibel with obligations. 

Conversely, given a relation (of quality), we might for instance require that obligations are closed under improvement. 
More subtle requirements might work with distances. The relations of quality and distance can be given abstractly (as 
the notion of size used for "soft" obligations), or as above by a starting set of obligations. We will also define important 
auxiliary concepts on such abstract relations. 

7.1.1.6 Derivation revisited 

A set of "basic" obligations generates an ordering and a distance between situations, ordering and distance can be used to 
define properties obligations should have. It is thus natural to define obligations derived from the basic set as those sets 
of situations which satisfy the desirable properties of obligations defined via the order and distance generated by the basic 
set of obligations. Our derivation is thus a two step procedure: first generate the order and distance, which define suitable 
sets of situations. 

We will call properties which are defined without using distances global properties (like closure under improving quality), 
properties involving distance (like being a neighbourhood) local properties. 

7.1.1.7 Relativization 

An important problem is relativization. Suppose O is a set of obligations for all possible situations, e.g. O is the obligation 
to post the letter, and O' is the obligation to water the flowers. Ideally, we will do both. Suppose we consider now a subset, 
where we cannot do both (e.g. for lack of time). What arc our obligations in this subset? Are they just the restrictions to 
the subset? Conversely, if O is an obligation for a subset of all situations, is then some O' with O C O' an obligation for 
the set of all situations? 

In more complicated requirements, it might be reasonable e.g. to choose the ideal situations still in the big set, even if 
they are not in the subset to be considered, but use an approximation inside the subset. Thus, relativizations present a 
non-trivial problem with many possible solutions, and it seems doubtful whether a universally adequate solution can be 
found. 



7.1.1.8 Numerous possibilities 

Seeing the possibilities presented so far (set or counting order, set or counting distance, various relativizations), we can 
already guess that there are numerous possible reasonable approaches to what an obligation is or should be. Consequently, 
it seems quite impossible to pursue all these combinations in detail. Thus, we concentrate mostly on one combination, and 
leave it to the reader to fill in details for the others, if (s)he is so interested. 

We will also treat the defeasible case here. Perhaps somewhat surprisingly, this is straightforward, and largely due to 
the fact that there is one natural definition of "big" sets for the product set, given that "big" sets are defined for the 
components. So there are no new possibilities to deal with here. 
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and misunderstandings on the one side, and loopholes for not quite honest argumentation in practical juridical reasoning 
on the other hand. 



7.1.1.9 Notation 

V(X) will be the powerset of X, A C B will mean that A is a subset of -B, or A = B, A <Z B that A is a proper subset of 
B. 



7.1.1.10 Overview 

We will work in a finite propositional setting, so there is a trivial and 1-1 correspondence between formulas and model 
sets. Thus, we can just work with model sets - which implies, of course, that obligations will be robust under logical 
reformulation. So we will formulate most results only for sets. 

• In Section l7.1.2l fpage !125p , we give the basic definitions, together with some simple results about those definitions. 

• Section 17.1.31 (page II 3 1|) will present a more philosophical discussion, with more examples, and we will argue that 
our definitions are relevant for our purpose. 

• As said already, there seems to be a multitude of possible and reasonable definition s of w hat a n obl igation can or 
should be, so we limit our formal investigation to a few cases, this is given in Section [7. 1.41 (page [134]). 

• In Section 17.1.51 (page IT38]) . we give a tentative definition of an obligation. 

(The concept of neighbourhood semantics is not new, and was already introduced by D.Scott, |Sco70j . and R.Montague, 
[Mon70] . Further investigations showed that it was also used by O.Pacheco, |Pac07l . precisely to avoid unwanted wea kening 
for ob ligat ions. We came the other way and started with the concept of independent strengthening, see Definition 1 7. 1.131 
(page fl31j) . and introduced the abstract concept of neighbourhood semantics only at the end. This is one of the reasons 
we also have different descriptions which turned out to be equivalent: we came from elsewhere.) 



7.1.2 Basic definitions 

We gi ve he re all definitions needed for our treatment of obligations. The reader may continue immediately to Section [7. 1.31 
(page I131j) , and come back to the present section whenever necessary. 

Intuitively, U is supposed to be the set of models of some propositional language £, but we will stay purely algebraic 
whenever possible. U' C U is some subset. 

For ease of writing, we will here and later sometimes work with propositional variables, and also identify models with the 
formula describing them, still this is just shorthand for arbitrary sets and elements, pq will stand for p A q, etc. 

If a set O of obligations is given, these will be just arbitrary subsets of the universe U. We will also say that O is over U. 

Before we deepen the discussion of more conceptual aspects, we give some basic definitions (for which we claim no origi- 
nality). We will need them quite early, and think it is better to put them together here, not to be read immediately, but 
so the reader can leaf back and find them easily. 

We work here with a notion of size (for the defeasible case), a notion d of distance, and a quality relation < . The latter 
two can be given abstractly, but may also be defined from a set of (basic) obligations. 

We use these notions to describe properties obligations should, in our opinion, have. A careful analysis will show later 
interdependencies between the different properties. 



7.1.2.1 Product Size 

For each U' C U we suppose an abstract notion of size to be given. We may assume this notion to be a filter or an ideal. 
Coherence properties between the filters/ideals of different U', U" will be left open, the reader may assume them to be the 
conditions of the system P of preferential logic, 

see Section [2] (page [27]) . 

Given such notions of size on U' and U" , we will also need a notion of size on U' x U". We take the evident solution: 
Definition 7.1.1 

Let a notion of "big subset" be defined by a principal filter for all X C U and all X' C U'. Thus, for all X C U there 
exists a fixed principal filter J-(X) C V(X), and likewise for all X' C [/'. (This is the situation in the case of preferential 
structures, where !F(X) is generated by n{X), the set of minimal elements of X.) 

Define now T{X x X') as generated by {A x A' : A e F(X), A' e F{X')}, i.e. if A is the smallest element of T(X), A' 
the smallest element of F{X'), then T{X x X') := {B C X x X 1 : A x A 1 C B}. 

Fact 7.1.1 

If T{X) and J-(X') are generated by preferential structures <x, d?x', then T(X x X 1 ) is generated by the product structure 
defined by 
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Proof 

We will omit the indices of the ordcrings when this causes no confusion. 

Let A G T{X), A' G T(X'), i.e. A minimizes X, A' minimizes X' . Let (x,x') elxT, then there are a e A, a' E A' with 
a < x, a 1 < x', so (a, a') ;< (x, x'). 

Conversely, suppose U C X x X', [/ minimizes X x X'. but there is no A x A' C [/ s.t. A G ^"(A), A' G .^(A'). Assume 
^4 = /Li(A), A' = /i(A'), so there is (a, a') G /"(A) x /Li(A'), (a, a') £ U. But only (a, a') ^ (a, a 1 ), and [/ does not minimize 
-X" x A', contradiction. 
□ 

Note that a natural modification of our definition: 

There is A G ^"(A) s.t. for all a G A there is a (maybe varying) A' a 6 T{X'), and U :— {{a, a') : a G A, a' G A' a } as 
generating sets 

will result in the same definition, as our filters are principal, and thus stable under arbitrary intersections. 
7.1.2.2 Distance 

We consider a set of sequences E, for i e S i : J ^ 5, la. finite index set, S some set. Often, S will be {0, 1}, x(i) = 1 
will mean that x G i, when / C V(U) and x G U. For abbreviation, we will call this (unsystematically, often context will 
tell) the G —case. Often, / will be written O, intuitively, O G O is then an obligation, and x(0) = 1 means x G O, or x 
"satisfies" the obligation O. 

Definition 7.1.2 

In the G -case, set O(x) := {O G O : x G O}. 
Definition 7.1.3 

Given x,y G E, the Hamming distance comes in two flavours: 

d s (x,y) := {i G / : a;(i) ^ the set variant, 

d c (x,y) := card{d s {x,y)) 1 the counting variant. 

We define d s (x,y) < d s (x',y') iff d s (x,y) C d s (x',y'), 

thus, s— distances are not always comparabel. 

Fact 7.1.2 

(1) In the G —case, we have d s (x,y) — 0(x)AO(y), where A is the symmetric set difference. 

(2) d c has the normal addition, set union takes the role of addition for d s , takes the role of for d s , both are distances 
in the following sense: 

(2.1) d(x, y) = if x = y, but not conversely, 

(2.2) d{x,y) = d(y,x), 

(2.3) the triangle inequality holds, for the set variant in the form d s (x, z) C d s (x, y) U d s (y, z). 
(If d(x,y) = x = y poses a problem, one can always consider equivalence classes.) 

Proof 

(2.1) Suppose U = {x,y}, O = {U}, then O(X) = 0(Y), but x + y. 

(2.3) If i $ d s (x, y) U d s (y, z), then x(i) = y(i) = z(i), so x(i) = z(i) and i £ d s (x, z). 

The others are trivial. 

□ 



Definition 7.1.4 

(1) We can define for any distance d with some minimal requirements a notion of "between". 
If the codomain of d has an ordering <, but no addition, we define: 

{x, y, z)d d(x, y) < d(x, z) and d(y, z) < d(x, z). 

If the codomain has a commutative addition, we define 

(x, y, z)d :<^> d(x, z) = d(x, y) + d(y, z) - in d s + will be replaced by U, i.e. 

(x, y, z) s d(x, z) = d(x, y) U d(y, z). 

For above two Hamming distances, we will write (x,y,z) s and (x,y, z) c . 

(2) We further define: 
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We will write [x, z] s and [x, z] c when appropriate. 

(3) For x G U, X C U set x || rf X := {x' G X : -^3x" ^ x' G X.d(x, x') > d(x, x")}. 
Note that, if X 0, then x || X ^ 0. 

We omit the index when this does not cause confusion. Again, when adequate, we write || s and || c . 
For problems with characterizing "between" see |Sch04| . 
Fact 7.1.3 

(0) {x,y,z) d <=> (z,y,x) d . 

Consider the situation of a set of sequences E. 

Let A := A a ^n ■= {a 1 ■ Vi G I(a(i) = a"(i) -» a'(i) = a(i) = a"(i))}. Then 

(1) If a' G A, then d s (a,a") = d s (a, a') U d B (a', a"), so (a, a', a") s . 

(2) If a' G A and S consists of 2 elements (as in classical 2-valued logic), then d s (a,a') and d s (<r',cr") are disjoint. 

(3) [a,a"] s = A. 

(4) If, in addition, S consists of 2 elements, then [er, a"] c = A. 

Proof 

(0) Trivial. 

(1) " C " follows from Fact [7X21 (page [T2^)l . (2.3). 
Conversely, if e.g. i G d s (a, er'), then by prerequisite i G d s (a, er"). 

(2) Let i G d s (a,a') n d s (a',a"), then cr(i) ^ cr'(i) and cr'(i) ^ cr"(i), but then by card(S) = 2 a(i) = a"(i), but a' G A, 
contradiction. 

We turn to (3) and (4): 

If er' A, then there is i' s.t. cr(i') = a"(i') ^ cr'(i')- On the other hand, for all i s.t. a(i) ^ cr"(i) i G d s (a,a') Ud s (er', er"). 
Thus: 

(3) By (1) er' G A (a,a',a"} s . Suppose a' ^ A, so there is i' s.t. i' G d s (a,a') — d s {a,a") 1 so (er, er', er") s cannot be. 

(4) By (1) and (2) a' G A (a,a',a") c . Conversely, if a 1 g A, then card(d s (a, er')) +card(d s (a', er")) > card(d s (a,a")) + 2. 
□ 



Definition 7.1.5 

Given a finite propositional laguage C defined by the set v (£) of propositional variables, let £ A be the set of all consistent 
conjunctions of elements from v{C) or their negations. Thus, p A G £ A if P, Q G v(C), but pV q, ->(p Aq) ^ £ A . Finally, 
let £ V a be the set of all (finite) disjunctions of formulas from £ A . (As we will later not consider all formulas from £ A , this 
will be a real restriction.) 

Given a set of models M for a finite language C, define 4>m ■— f\{p G v(C) : Vm G M.m{p) = v} A /\{^p : p G u(£), Vm G 
M.m(p) — /} G £ A . (If there are no such p, set 0m := TRUE.) 

This is the strongest </> G £ A which holds in M. 
Fact 7.1.4 

If x,y are models, then [x,y] = M(^>/ x>2 a). □ 



Proof 

to G [x, y] ^ Vp(x \= p,y \= p to |= p and x ^ p, y ty= p => rn ^ p), m \= (f>{ x , y } <=> to h A{p : ^(p) = y(p) = v } A Al -1 ?? : 
x(p) = y{p) = /}• 

7.1.2.3 Quality and closure 
Definition 7.1.6 

Given any relation < (of quality), we say that X C U is (downward) closed (with respect to ^) iff Vx G AVy &U(y < x 
^yeX). 



(Warning, we follow the preferential tradition, "smaller" will mean "better".) 
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Fact 7.1.5 

Let ^ be given. 

(1) Let D C U' C U", D closed in U", then D is also closed in U'. 

(2) Let D CU' C U", D closed in U', U' closed in U", then D is closed in £/". 

(3) Let Di C [/' be closed for all i G /, then so are U{A and : i G /}. 

Proof 

(1) Trivial. 

(2) Let x G D C J/', a;' ^ a;, x' G C7", then x' G [/' by closure of U", so x' e D by closure of V. 

(3) Trivial. 
□ 

We may have an abstract relation ^ of quality on the domain, but we may also define it from the structure of the sequences, 
as we will do now. 

Definition 7.1.7 

Consider the case of sequences. 

Given a relation ^ (of quality) on the codomain, we extend this to sequences in S : 
x ~ y Vt G I(x(i) ~ y{i)) 
x < y Vt G J(x(i) ^ 

x -< y Vt G J(a;(i) ^ and 3i G 7(x(i) -< 

In the G —case, we will consider x £ i better than x G - i. As we have only two values, true/false, it is easy to count the 
positive and negative cases (in more complicated situations, we might be able to multiply), so we have an analogue of the 
two Hamming distances, which we might call the Hamming quality relations. 

Let O be given now. 

(Recall that we follow the preferential tradition, "smaller" will mean "better".) 

x~ s y :&0(x) =0(y), 

x<sV-^0(y)QO(x), 

x^ s y:^ 0{y) C 0(x), 

x ~ c y card{0(x)) = card(0(y)), 

x <c V card(0(y)) < card{0(x)), 

x ^ c y :<^> card(0(y)) < card{0(x)). 

The requirement of closure causes a problem for the counting approach: Given e.g. two obligations 0, O', then any two 
elements in just one obligation have the same quality, so if one is in, the other should be, too. But this prevents now any of 
the original obligations to have the desirable property of closure. In the counting case, we will obtain a ranked structure, 
where elements satisfy 0, 1, 2, etc. obligations, and we are unable to differentiate inside those layers. Moreover, the set 
variant seems to be closer to logic, where we do not count the propositional variables which hold in a model, but consider 
them individually. For these reasons, we will not pursue the counting approach as systematically as the set approach. One 
should, however, keep in mind that the counting variant gives a ranking relation of quality, as all qualities are comparable, 
and the set variant does not. A ranking seems to be appreciated sometimes in the literature, though we are not really sure 
why. 

Of particular interest is the combination of d s and ^ s (d c and ^< c ) respectively - where by ^ s we also mean and 
etc. We turn to this now. 

Fact 7.1.6 

We work in the G —case. 
{I)x< 8 y^d s {x,y) = 0{x)-0{y) 
Let a -< s b -< s c. Then 

(2) d s (a,b) and d s (b,c) are not comparable, 

(3) d s (a, c) = d s (a, b) U d s (b, c), and thus b G [a, c] s . 

This does not hold in the counting variant, as Example 1 7. 1.1 1 fpage [T2T)|) shows. 

(4) Let x ~< s y and x' -< s y with x,x' ~< s — incomparabel. Then d s (x,y) and d s (x',y) are incomparable. 
(This does not hold in the counting variant, as then all distances are comparable.) 

(5) If x -< s z, then for all y G [x, z] s x ^ s y ^ s z. 
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(1) Trivial. 

(2) We have 0(c) C 0(b) C 0(a), so the results follows from (1). 

(3) By definition of d s and (1). 

(4) x and x' are < s —incomparable, so there are O G 0(a:) — 0{x'), O' G 0(x') — 0(x). 
As ar.x' -< s y, 0,0' f 0(y), so O G <2 s (aJ,y) -d s {x',y), O' G d s (x',y) - d s {x,y). 

(5) a; -< s z ^> 0(z) C 0(x), d s (a;,z) = 0(a;) — 0(z). By prerequisite d s (a;,z) = d s {x,y) Ud s (y,z). Suppose a; ^ s y. Then 
there is i G 0(y) — 0(x) C d s (x, y), so i ^ 0{x) — 0{z) — d s (x, z), contradiction. 

Suppose y z. Then there is i G 0(z) — O(y) C d s (y, z), so i 0(:c) — 0( z ) = d s (a:, z), contradiction. 
□ 



Example 7.1.1 

In this and similar examples, we will use the model notation. Some propositional variables p, q, etc. are given, and models 
are described by p->qr, etc. Moreover, the propositional variables are the obligations, so in this example we have the 
obligations p, q, r. 

Consider x := -ip-igr, y := pq~^r, z := -ip-iq-ir. Then y -< c x -< c z, d c (x, y) = 3, d c (x, z) — 1, d c (z, y) = 2, so x g" [y, z] c . □ 



Definition 7.1.8 

Given a quality relation -< between elements, and a distance d, we extend the quality relation to sets and define: 

(1) x -< Y :<^> Vy G (x || F).x -< y. (The closest elements - i.e. there are no closer ones - of Y, seen from x, are less good 
than x.) 

analogously X -< y Vx G (y || X).x -< y 

(2) X Y Vx G X.x -< Y and Vy G F.Jf -< y (X is locally better than Y). 

When necessary, we will write -<i >s or -<; x to distinguish the set from the counting variant. 

For the next definition, we use the notion of size: \7(f> iff for almost all <p holds i.e. the set of exceptions is small. 

(3) ViG X.x -< Y and Vy G Y.X -< y. 
We will likewise write <C;, S etc. 

This definition is supposed to capture quality difference under minimal change, the "ceteris paribus" idea: X -<i CX 
should hold for an obligation X. Minimal change is coded by ||, and "ceteris paribus" by minimal change. 

Fact 7.1.7 

If X CX, and x G U an optimal point (there is no better one), then x G X. 
Proof 

If not, then take x' G X closest to x, this must be better than x, contradiction. □ 



Fact 7.1.8 

Take the set version. 

If X -<i iS CX, then X is downward ~< s —closed. 
Proof 

Suppose X -<,i tS CX, but X is not downward closed. 

Case 1: There are x G X, y G" X, y ~ s x. Then y G x \\ s CX, but x -fc y, contradiction. 

Case 2: There are x G X, y £ X, y -< s x. By X -<i >s CX, the elements in X closest to y must be better than y. Thus, 
there is x' -< s y, x 1 G X, with minimal distance from y. But then x 1 < s y -K s x, so d s (x',y) and d s (y,x) are incomparable 
by Fact 17.1.^1 (page [T2"5)) . so x is among those with minimal distance from y, so X ^;. s CX does not hold. □ 



Example 7.1.2 

We work with the set variant. 

This example shows that ;< s —closed does not imply X CX, even if X contains the best elements. 

T „+- /O ._ f„ „ „ „1 TTl .— f „ .— „_„_, „ „. ._ „ „l .— V .— <„ „n „1 +U„ K™ + „1™,™+ TTl 
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d s (x', y) — {p, r, s}, so the distances from y arc not comparable, so x is among the closest elements in X, seen from y, but 
x A y- 

The lack of comparability is essential here, as the following Fact shows. 
□ 



We have, however, for the counting variant: 
Fact 7.1.9 

Consider the counting variant. Then 

If X is downward closed, then X -<; ;C CX. 

Proof 

Take any x G X, y X. We have y < c x or x < c y, as any two elements are < c — comparabel. y < c x contradicts closure, 
so x -< c y, and X -<j )C CX holds trivially. □ 



7.1.2.4 Neighbourhood 

Definition 7.1.9 

Given a distance d, we define: 

(1) Let X C Y C U', then Y is a neighbourhood of X in U' iff 

Vy e Y\/x e X(x is closest to y among all x' with x' e X => [a:, y] n U' C F). 
(Closest means that there are no closer ones.) 
When we also have a quality relation we define: 

(2) Let X C y C [/', then F is an improving neighbourhood of X in U' iff 

Vy e FVa;((a: is closest to y among all x' with x' <E X and x' < y) => [x, y] n J7' C F). 

When necessary, we will have to say for (3) and (4) which variant, i.e. set or counting, we mean. 

Fact 7.1.10 

(1) If X C X' C S, and y) = ^> x = y, then X and X' are Hamming neighbourhoods of X in X'. 

(2) H X C 7j C J' C E for j e J, and all Yj are Hamming Neighbourhoods of X in X', then so arc {J{Yj : j G J} and 

ni> - :./ ' •/!• 

Proof 

(1) is trivial (we need here that d(x, y) = => x = y). 

(2) Trivial. 
□ 



7.1.2.5 Unions of intersections and other definitions 

Definition 7.1.10 

Let O over U be given. 

X C U' is (m) (for union of intersections) iff there is a family O, C O, j e / s.t. X = ({J{f)Oi :»€/}) n 
Unfortunately, this definition is not very useful for simple relativization. 
Definition 7.1.11 

Let O be over U. Let O' C 0. Define for m £ [/ and <5 : 0' 2 = {0, 1} 
mh^^VOe 0'(m eOo 5(0) = 1) 

Definition 7.1.12 

Let be over U. 

is independent iff V<5 : 2.3m e C/.m |= 5. 
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Definition 7.1.13 

This definition is only intended for the set variant. 
Let O be over U. 

V(0) := {lC[/':VO'COV(5:0'->2 

((3m, to' £ U, m,m' |= 8, m £ X, to' ^ X) (3m" £ Xm" |= 5 A m" -< a to'))} 

This property expresses that we can satisfy obligations independently: If we respect O, we can, in addition, respect O', 
and if we are hopeless kleptomaniacs, we may still not be a murderer. If X £ T>(0), we can go from U — X into X by 
improving on all O £ O, which we have not fixed by S, if 6 is not too rigid. 



7.1.3 Philosophical discussion of obligations 



We take now a closer look at obligations, in particular at the ramifications of the treatment of the relation "better" . Some 
aspects of obligations will also need a notion of distance, we call them local properties of obligations. 

7.1.3.1 A fundamental difference between facts and obligations: asymmetry and negation 

There is an important difference between facts and obligations. A situation which satisfies an obligation is in some sense 
"good", a situation which does not, is in some sense "bad". This is not true of facts. Being "round" is a priori not better 
than "having corners" or vice versa. But given the obligation to post the letter, the letter in the mail box is "good" , the 
letter in he trash bin is "bad" . Consequently, negation has to play different role for obligations and for facts. 

This is a fundamental property, which can also be found in orders, planning (we move towards the goal or not), reasoning 
with utility (is <fi or —«f) more useful?), and probably others, like perhaps the Black Raven paradox. 

We also think that the Ross paradox (see below) is a true paradox, and should be avoided. A closer look shows that this 
paradox involves arbitrary weakening, in particular by the "negation" of an obligation. This was a starting point of our 
analysis. 

"Good" and "bad" cannot mean that any situation satisfying obligation O is better than any situation not satisfying O, 
as the following example shows. 

Example 7.1.3 

If we have three independent and equally strong obligations, O, O' , O" , then a situation satisfying O but neither O' nor 
O" will not be better than one satisfying O' and O" , but not O. 

We have to introduce some kind of "cet eris par ibus" . All other things being equal, a situation satisfying O is better than 
a situation not satisfying O, see Section [7.1.3.31 (page ll32p . 

Example 7.1.4 

The original version of the Ross paradox reads: If we have the obligation to post the letter, then we have the obligation 
to post or burn the letter. Implicit here is the background knowledge that burning the letter implies not to post it, and is 
even worse than not posting it. 

We prefer a modified version, which works with two independent obligations: We have the obligation to post the letter, 
and we have the obligation to water the plants. We conclude by unrestricted weakening that we have the obligation to 
post the letter or not to water the plants. This is obvious nonsense. 

It is not the "or" itself which is the problem. For instance, in case of an accident, to call an ambulance or to help the 
victims by giving first aid is a perfectly reasonable obligation. It is the negation of the obligation to water the plants 
which is the problem. More generally, it must not be that the system of suitable sets is closed under arbitrary supersets, 
otherwise we have closure under arbitrary right weakening, and thus the Ross paradox. Notions like "big subset" or "small 
exception sets" from the semantics of nonmonotonic logics are closed under supersets, so they are not suitable. 



7.1.3.2 "And" and "or" for obligations 

"Not" behaves differently for facts and for obligations. If O and O' are obligations, can O AO' be considered an obligation? 
We think, yes. "Ceteris paribus" , satisfying O and O' together is better than not to do so. If is the obligation to post the 
letter, O' to water the plants, then doing both is good, and better than doing none, or only one. Is O V O' an obligation? 
Again, we think, yes. Satisfying one (or even both, a non-exclusive or) is better than doing nothing. We might not have 
enough time to do both, so we do our best, and water the plants or post the letter. Thus, if a and (3 are obligations, then 
so will be a A/3 and a V j3, but not anything involving or ->(3. (In a non-trivial manner, leaving aside tautologies and 
contradictions which have to be considered separately.) To summarize: "and" and "or" preserve the asymmetry, "not" does 
not, therefore we can combine obligations using "and" and "or", but not "not". Thus, a reasonable notion of derivation of 
obligations will work with A and V, but not with -■. 

We should not close under inverse A, i.e. if <f> A <fi' is an obligation, we should not conclude that <j> and (/>' separately are 
obligations, as the following example shows. 

Example 7.1.5 

Let p stand for: post letter, w : water plants, s : strangle grandmother. 

Consider now 4>A(f>', where <j>= pV {ppf\—<w), <f>' = pV (-<pAw As). 4>A<f)' is equivalent to p - though it is perhaps a bizarre 
way to express the obligation to post the letter. 4> leaves us the possibility not to water the plants, and </>' to strangle the 
grandmother, and neither seem good obligations. □ 
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Remark 7.1.11 

This is particularly important in the case of soft obligations, as we see now, when we try to apply the rules of preferential 
reasoning to obligations. 

One of the rules of preferential reasoning is the {OR) rule: 

<j> |~ ip, <t>' l~ ^ => v 0' h V 1 - 

Suppose we have 4> |~ ^'A^", and 0' |~ -0'- We might be tempted to split 0' Atp" - as 0' is a "legal" obligation, and argue: 
(j> |~ ^' A ip", so <fi |~ 0', moreover </>' |~ 0', so V </>' |~ The following example shows that this is not always justified. 

Example 7.1.6 

Consider the following obligations for a physician: 

Let cj>' imply that the patient has no heart disease, and if <p' holds, we should give drug A or (not drug A, but drug B), 
abbreviated A V (-iA A B). (B is considered dangerous for people with heart problems.) 

Let <f> imply that the patient has heart problems. Here, the obligation is (A V (-<A A B)) A (A V (-*A A equivalent to 

A. 

The false conclusion would then be 0' ^ A V £-u4 A £?), and (~ A V f-ivl A B), so <f> V 0' |~ A V (->vl AS), so in both 
situation we should either give A or B, but B is dangerous in "one half" of the situations. 

□ 



We captured this idea about "and" and "or" in Definition 17.1.101 (page 1130)) . 

7.1.3.3 Ceteris paribus - a local poperty 

Basic ally, the set of points "in" an obligation has to be better than the set of "exterior" points. As above Example 17.1.31 
(page 1 1 3 1 P with three obligations shows, demanding that any element inside is better than any element outside, is too 
strong. We use instead the "ceteris paribus" idea. 

"All other things being equal" seems to play a crucial role in understanding obligations. Before we try to analyse it, we 
look for other concepts which have something to do with it. 

The Stalnaker/Lewis semantics for counterfactual conditionals also works with some kind of "ceteris paribus" . "If it were 
to rain, / would use an umbrella" means something like: "If it were to rain, and there were not a very strong wind" (there 
is no such wind now), "if / had an umbrella" (I have one now), etc., i.e. if things were mostly as they are now, with the 
exception that now it does not rain, and in the situation I speak about it rains, then / will use an umbrella. 

But also theory revision in the AGM sense contains - at least as objective - this idea: Change things as little as possible 
to incorporate some new information in a consistent way. 

When looking at the "ceteris paribus" in obligations, a natural interpretation is to read it as "all other obligations being 
unchanged" (i.e. satisfied or not as before). This is then just a Hamming distance considering the obligations (but not 
other information). 

Then, in particular, if O is a family of obligations, and if x and x' are in the same subset O' C O of obligations, then an 
obligation derived from O should not separate them. More precisely, ifi£OeO<Si'e06 0, and D is a derived 
obligation, then x G D x' G D. 

Example 7.1.7 

If the only obligation is not to kill, then it should not be derivable not to kill and to eat spaghetti. 

Often, this is impossible, as obligations are not independent. In this case, but also in other situations, we can push "ceteris 
paribus" into an abstract distance d (as in the Stalnaker/Lewis semantics), which we postulate as given, and say that 
satisfying an obligation makes things better when going from "outside" the obligation to the d— closest situation "inside" . 
Conversely, whatever the analysis of "ceteris paribus", and given a quality order on the situations, we can now define an 
obligation as a formula which (perhaps among other criteria) "ceteris paribus" improves the situation when we go from 
"outside" the formula "inside" . 

A simpler way to capture "ceteris paribus" is to connect it directly to obligations, see Definition 17.1.131 fpage [T"3"Tj) . This 
is probably too much tied to independence (see below), and thus too rigid. 

7.1.3.4 Hamming neighbourhoods 

A combination concept is a Hamming neighbourhood: 

X is called a Hamming neighbourhood of the best cases iff for any x G X and y a best case with minimal distance from 
x, all elements between x and y are in X. 

For this, we need a notion of distance (also to define "between" ). This was made precise in Definition 17.1.31 (page [T2"o]) 
and Definition [7X1 (page [HO]). 



7.1.3.5 Global and mixed global/local properties of obligations 



7.1. 

(1) 
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Downward closure 

Consider the following example: 
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Example 7.1.8 

Let U' := {x, x', y, y'} with x' := pqrs, y' :— pqi — <s, x :— -ip->qi — <s, y := -ip-iq-ii — >s. 
Consider X := {x,x'}. 
The counting version: 

Then x' has quality 4 (the best), y' has quality 3, x has 1, y has 0. 
d c (x',y') = 1, d c (x,y) = 1, d c (x,y') = 2. 

Then above "ceteris paribus" criterion is satisfied, as y' and x do not "see" each other, so X -<i iC CX. 
But X is not downward closed, below x S X is a better element y' ^ X. 
This seems an argument against X being an obligation. 
The set version: 

We still have x' < s y' -< 8 x -< s y. As shown in Fact 17.1.61 fpage [i"28|) , d s (x,y) (and also d s (x',y')) and d s (x,y') are 
not comparable, so our argument collapses. 

As a ma tter of fact, we h ave the result that the "ceteris paribus" criterion entails downward closure in the set variant, 
see Fact [7X51 (page HMD • 

□ 



Note that a sufficiently rich domain (put elements between y' and x) will make this local condition (for -><) a global 
one, so we have here a domain problem. Domain problems are discussed e.g. in S( ill) II and |GS08a| . 

(2) Best states 

It seems also reasonable to postulate that obligations contain all best states. In particular, obligations have then to 
be consistent - under the condition that best states exist. We are aware that this point can be debated, there is, of 
course, an easy technical way out: we take, when necessary, unions of obligations to cover the set of ideal cases. So 
obligations will be certain "neighbourhoods" of the "best" situations. 

We think, that some such notion of neighbourhood is a good candidate for a semantics: 

• A system of neighbourhoods is not necessarily closed under supersets. 

• Obligations express something like an approximation to the ideal case where all obligations (if possible, or, as 
many as possible) are satisfied, so we try to be close to the ideal. If we satisfy an obligation, we are (relatively) 
close, and stay so as long as the obligation is satisfied. 

• The notion of neighbourhood expresses the idea of being close, and containing everything which is sufficiently 
close. Behind "containing everything which is sufficiently close" is the idea of being in some sense convex. Thus, 
"convex " or "between" is another basic notion to be investigated. See here also the discussion of "between" in 
[Sch04) . 



7.1.3.6 Soft obligations 

"Soft" obligations are obligations which have exceptions. Normally, one is obliged to do O, but there are cases where one 
is not obliged. This is like soft rules, as "Birds fly" (but penguins do not), where exceptions are not explicitly mentioned. 

The semantic notions of size are very useful here, too. We will content ourselves that soft obligations satisfy the postulates 
of usual obligations everywhere except on a small set of cases. For instance, a soft obligation O should be downward closed 
"almost" everywhere, i.e. for a small subset of pairs (a, b) in U x U we accept that a < b, b G O, a O. We transplanted 
a suitable and cautious notion of size from the components to the product in Definition 17.1.11 (page I125[) . 

When we look at the requirement to contain the best cases, we might have to soften this, too. We will admit that a small 
set of the ideal cases might be excluded. Small can be relative to all cases, or only to all ideal cases. 

Soft obligations generate an ordering which takes care of exceptions, like the normality ordering of birds will take care of 
penguins: within the set of pengins, non-flying animals are the normal ones. Based on this ordering, we define "derived 
soft obligations" , they may have (a small set of) exceptions with respect to this ordering. 



7.1.3.7 Overview of different types of obligations 

(1) Hard obligations. They hold without exceptions, as in the Ten Commandments. You should not kill. 

(1.1) In the simplest case, they apply everywhere and can be combined arbitrarily, i.e. for any O' C O there is a 
model where all O 6 O 1 hold, and no O' E O - O' . 

(1.2) In a more complicated case, not all combinations are possible. This is the same as considering just an arbitrary 
subset of U with the same set O of obligations. This case is very similar to the case of conditional obligations 
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Example 7.1.9 

Normally, one should not offer a cigarette to someone, out of respect for his health. But the considerate assassin 
might do so nonetheless, on the cynical reasoning that the victim's health is going to suffer anyway: 

(1) One should not kill, ->fc. 

(2) One should not offer cigarettes, ->o. 

(3) The assassin should offer his victim a cigarette before killing him, if k, then o. 

Here, globally, -ifc and -io is best, but among k— worlds, o is better than ->o. The model ranking is ^k A -< 
-^kho^,kho<kf\ -io. 

Recall that an obligation for the whole set need not be an obligation for a subset any more, as it need not contain 
all best states. In this case, we may have to take a union with other obligations. 

(2) Soft obligations. 

Many obligations have exceptions. Consider the following example: 

Example 7.1.10 

You are in a library. Of course, you should not pour water on a book. But if the book has caught fire, you should pour 
water on it to prevent worse damage. In stenographic style these obligations read: "Do not pour water on books" . 
"If a book is on fire, do pour water on it." It is like "birds fly", but "penguins do not fly", "soft" or nonmonotonic 
obligations, which have exceptions, which are not formulated in the original obligation, but added as exceptions. 

We could have formulated the library obligation also without exceptions: "When you are in a library, and the book 
is not on fire, do not pour water on it." "When you are in a library, and the book is on fire, pour water on it." This 
formulation avoids exceptions. Conditional obligations behave like restricted quantifiers: they apply in a subset of 
all possible cases. 

We treat now the considerate assassin case as an obligation (not to offer) with exceptions. Consider the full set U, 
and consider the obligation ->o. This is not downward closed, as k A o is better than k A ->o. Downward closure will 
only hold for "most" cases, but not for all. 

(3) Contrary-to-duty obligations. 

Contrary-to-duty obligations are about different degrees of fulfillment. If you should ideally not have any fence, but 
are not willing or able to fulfill this obligation (e.g. you have a dog which might stray), then you should at least 
paint it white to make it less conspicuous. This is also a conditional obligation. Conditional, as it specifies what has 
to be done if there is a fence. The new aspect in contrary-to-duty obligations is the different degree of fulfillment. 

We will not treat contrary-to-duty obligations here, as they do not seem to have any import on our basic ideas and 
solutions. 

(4) A still more complicated case is when the language of obligations is not uniform, i.e. there are subsets V C U where 
obligations are defined, which are not defined in U — V. 

We will not pursue this case here. 



7.1.3.8 Summary of the philosophical remarks 

(1) It seems justifiable to say that an obligation is satisfied or holds in a certain situation. 

(2) Obligations are fundamentally asymmetrical, thus negation has to be treated with care. "Or" and "and" behave as 
for facts. 

(3) Satisfying obligations improves the situation with respect to some given grading - ceteris paribus. 

(4) "Ceteris paribus" can be defined by minimal change with respect to other obligations, or by an abstract distance. 

(5) Conversely, given a grading and some distance, we can define an obligation locally as describing an improvement 
with respect to this grading when going from "outside" to the closest point "inside" the obligation. 

(6) Obligations should also have global properties: they should be downward (i.e. under increasing quality) closed, and 
cover the set of ideal cases. 

(7) The properties of "soft" obligations, i.e. with exceptions, have to be modified appropriately. Soft obligations generate 
an ordering, which in turn may generate other obligations, where exceptions to the ordering are permitted. 

(8) Quality and distance can be defined from an existing set of obligations in the set or the counting variant. Their 
behaviour is quite different. 

(9) We distinguished various cases of obligations, soft and hard, with and without all possibilities, etc. 

Finally, we should emphasize that the notions of distance, quality, and size are in principle independent, even if they may 
be based on a common substructure. 



7.1.4 Examination of the various cases 
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7.1.4.1 Hard obligations for the set approach 

7.1.4.1.1 Introduction We work here in the set version, the G —case, and examine mostly the set version only. 

We will assume a set O of obligations to be given. We define the relation -<:—<o as described in Definition 17.1.31 (page 
I126p . and the distance d is the Hamming distance based on O. 

7.1.4.1.2 The not necessarily independent case 
Example 7.1.11 

Work in the set variant. We show that X < s —closed does not necessarily imply that X contains all ^ s —best elements. 

Let O :— {p, q}, U' :— {p^q,^pq}, then all elements of U' have best quality in U', X :— {p-*q} is closed, but does not 
contain all best elements. □ 



Example 7.1.12 

Work in the set variant. We show that X ^< s —closed does not necessarily imply that X is a neighbourhood of the best 
elements, even if X contains them. 

Consider x := pq-rstu, x' :— -ipqrs—>t->u, x" :— p->qi — is->t—iu, y := p-iq->i — is-it-iu, z := pq->i — <s-<t^u. U := {x, x', x", y, z}, 
the -< s —best elements are x,x',x", they are contained in X := {x,x',x",z}. d s (z,x) — {s,t, u}, d s {z 1 x') — {p,r,s}, 
d s (z, x") = {q, r}, so x" is one of the best elements closest to z. d(z, y) = {q}, d(y, x") — {r}, so [z, x"] — {z, y, x"}, y £ X, 
but X is downward closed. □ 



Fact 7.1.12 

Work in the set variant. 

Let X ^ 0, X ^ s -closed. Then 

(1) X does not necessarily contain all best elements. 

Assume now that X contains, in addition, all best elements. Then 

(2) X ~<i yS CX does not necessarily hold. 

(3) X is (ui). 

(4) X G T>{0) does not necessarily hold. 

(5) X is not necessarily a neighbourhood of the best elements. 

(6) X is an improving neighbourhood of the best elements. 

Proof 

(1) See Example EXm (page H35]) 

(2) See Example EU (page [129]) 

(3) If there is m e X, m g O for all 6 0, then by closure X = U, take { := 0. 
For m £ X let O m := {O E O : m G O}. Let X' := \J{f] O m : m G X}. 

X C X' : trivial, as m G X -> m G f| O m C X' . 

X I C X : Let m! G f] O m for some m G X. It suffices to show that m! <s m. m' £ f] O m = C\{0 G O : rn G O}, so for all 
O eO {meO ^rn' eO). 

(4) Consider Example 17X21 (page [T2^)l . let dom{8) = {r, s}, S(r) = 5{s) = 0. Then x,y \= S, but x' tf= 5 and i£l,i/P, 
but there is no z G X, z (= <5 and z ^ y, so J ^ T>(0). 

(5) See Example [7XT3 (page [SSI). 

(6) By Fact ITXol f page H25I). (5). 
□ 



Fact 7.1.13 

Work in the set variant 

(1.1) X -<;. s CX implies that X is ^ s —closed. 

(1.2) X -<; )S => X contains all best elements 

(2.1) X is (ui) => X is ^ s -closed. 

(2.2) X is (ui) does not necessarily imply that X contains all < s —best elements. 
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(3.2) X € V(0) implies that X contains all < s —best elements. 

(4.1) X is an improving neighbourhood of the ^ s —best elements X is ^ s —closed. 

(4.2) X is an improving neighbourhood of the best elements X contains all best elements. 

Proof 

(1.1) By Fact EXU (page [Ml)- 

(1.2) By Fact [7X71 fpage fT29)) . 

(2.1) Let O G O, then O is downward closed (no y $ O can be better than x G O). The rest follows from Fact 17.1. "51 (page 

USED (3). 

(2.2) Consider Example 17. 1.1 II (page [T3"5|) , p is (ui) (formed in UI), but pll X does not contain ~^pq. 

(3.1) Let X G T>(0), but let X not be closed. Thus, there are m G X, m! < s to, m! g" X. 

Case 1: Suppose to' ~ to. Let 8 m : O — > 2, <5 m (0) = 1 iff m G O. Then m, m' |= <5 m , and there cannot be any to" \= S m , 
m" -< s m', so X g V(0). 

Case 2: w! < s to. Let O' := {O G O : m G O to' G O}, dom(<5) = C, 5(0) := 1 iff to G O for O G C. Then m, ml |= 5. 
If there is O G s.t. to' ^ 0, then by m' i m m ^ O, so O £ C. Thus for all O ^ dom(8).m' G 0. But then there is no 
to" |= <5, m" -i s m', as m' is already optimal among the n with n \= 8. 

(3.2) Suppose X G T>(0), x' G U — X is a best element, take i := f), i £ X. Then there must be a;" -< x' , x" G X, but this 
is impossible as x' was best. 

(4.1) By Fact 17.1.1)1 (page [128]) , (4) all minimal elements have incomparabel distance. But if z < y, y G X, then either z 
is minimal or it is above a minimal element, with minimal distance from y, so z G X by Fact I7.1TB1 (page [T2"5)) (3). 

(4.2) Trivial. 
□ 



7.1.4.1.3 The independent case Assume now the system to be independent, i.e. all combinations of are present. 

Note that there is now only one minimal element, and the notions of Hamming neighbourhood of the best elements and 
improving Hamming neighbourhood of the best elements coincide. 

Fact 7.1.14 

Work in the set variant. 

Let X ^ 0, X ^ s -closed. Then 

(1) X contains the best element. 

(2) X ^ 1>S CX 

(3) X is (ui). 

(4) X G X>(0) 

(5) X is a (improving) Hamming neighbourhood of the best elements. 

Proof 

(1) Trivial. 

(2) Fix x G X, let y be closest to x, y €" X. Suppose x -/< y, then there must be O G s.t. y G O, x g" O. Choose y' s.t. y' 
is like y, only y' ^ O. If y' G X, then by closure y G X, so y' g" X. But y' is closer to x than y is, contradiction. 

Fix y G U— X. Let a; be closest to y, a; G X. Suppose x -f<y, then there is G s.t. y G 0, x ^ 0. Choose a/ s.t. a/ is like 
x, only a:' G O. By closure of X, x' G X, but a;' is closer to y than x is, contradiction. 

(3) By Fact EXTJ (page [1311) (3) 

(4) Let X be closed, and O' C O, 8 : O' -> 2, to, to' |= 5, m G X, to' £ X. Let to" be s.t. to" (= 8, and for all O G O-dom(S) 
to" G O. This exists by independence. Then to" ^< s to', but also to" ^< s to, so to" G X. Suppose to" ~ to', then to' ^ s to", 
so to' G A, contradiction, so to" -< s to'. 

(5) Trivial by (1), the remark preceding this Fact, and Fact I7.1.T21 fpage 1135ft (6). 

Fact 7.1.15 

Work in the set variant. 

(1) X ~<i^ s Cx => X is ^ s —closed, 

(2) X is (ui) => X is ^ s —closed, 

(3) X G P(0) X is ^ s -closed, 
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Proof 

(1) Suppose there are x £ X, y £ U— X, y -< x. Choose them with minimal distance. If card(d s (x, y)) > 1, then there is z, 
y -< s z -< s x, z S X or z € [/— X, contradicting minimality. So card(d s (x,y)) — 1. So y is among the closest elements of 
U — X seen from x, but then by prerequisite x -< y, contradiction. 

(2) By Fact 17. 1.131 (page H55|) (2.1). 

(3) By Fact 17. 1.131 fpage [T331) (3.1). 

(4) There is just one best element z, so if x <G X, then [x, z] contains all y y -< x by Fact 17.1. "51 (page [T2"8")) (3). 
□ 

The 2?(0) condition seems to be adequate only for the independent situation, so we stop considering it now. 
Fact 7.1.16 

Let Xj, C U, i G / a family of sets, we note the following about closure under unions and intersections: 

(1) If the Xi are downward closed, then so are their unions and intersections. 

(2) If the Xi are (ui), then so are their unions and intersections. 

Proof 

Trivial. □ 

We do not know whether -<i tS is preserved under unions and intersections, it does not seem an easy problem. 
Fact 7.1.17 

(1) Being downward closed is preserved while going to subsets. 

(2) Containing the best elements is not preserved (and thus neither the neighbourhood property). 

(3) The T>(0) property is not preserved. 

(4) ^z, s is not preserved. 

Proof 

(4) Consider Example 17. 1.81 (page [T33"|) , and eliminate y from U', then the closest to x not in X is y' , which is better. 
□ 

7.1.4.2 Remarks on the counting case 
Remark 7.1.18 

In the counting variant all qualities are comparabel. So if X is closed, it will contain all minimal elements. 

Example 7.1.13 

We measure distance by counting. 

Consider a := -^p->q^i — is, b := -ip—iq—trs, c := -ip—iqi — <s, d := pqi — is, let U := {a, b, c, <i}, X :— {a, c, d}. d is the best 

element, [a, d] — {a, d, c}, so X is an improving Hamming neighbourhood, but b -< a, so X y^i iC CX. 

□ 



Fact 7.1.19 

We measure distances by counting. 

X -<i tC CX does not necessarily imply that X is an improving Hamming neighbourhood of the best elements. 



Proof 

Consider Example 1 7. 1.81 fpage !133[) . There X ^; iC CX. x' is the best element, and y' € [x',x], but y' g' X. □ 
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7.1.5 What is an obligation? 

The reader will probably not expect a final definition. All we can do is to give a tentative definition, which, in all probability, 
will not be satisfactory in all cases. 

Definition 7.1.14 

We decide for the set relation and distance. 

(1) Hard obligation 

A hard obligation has the following properties: 

(1.1) It contains all ideal cases in the set considered. 

(1.2) It is closed under increasing quality, Definition 17. 1.61 fpage !127|) 

(1.3) It is an improving neighbourhood of the ideal cases (this also implies (1.1)), Definition 17. 1.91 fpage !130|) 
We are less committed to: 

(1.4) It is ceteris paribus improving, Definition 17.1.81 fpagc ll29"|) 

An obligation O is a derived obligation of a system O of obligations iff it is a hard obligation based on the set variant of 
the order and distance generated by O. 

(2) Soft obligations 

A set is a soft obligation iff it satisfies the soft versi ons of abov e pos tulates. The notion of size has to be given, and 
is transferred to products as described in Definition 17.1.11 (page I125[) . More precisely, strict universal quantifiers are 
transformed into their soft variant "almost all" , and the other operators are left as they are. Of course, one might also 
want to use a mixture of soft and hard conditions, e.g. we might want to have all ideal cases, but renounce on closure for 
a small set of pairs (x, x'). 

An obligation O is derived from O iff it is a soft obligation based on the set variant of the order and distance generated by 
the translation of O into their hard versions. (I.e. exceptions will be made explicit.) 

Fact 7.1.20 

Let O G O, then O |~ O in the independent set case. 
Proof 

We check (1.1) - (1.3) of Definition |7TJ4| (page H35J) • 

(1.1) holds by independence. 

(1.2) If x e O, x' g O, then x' ^ s x. 

(1.3) By Fact 17.1.121 fpagefHSl) (6). 

Note that (1.4) will also hold by Fact 17. 1.141 (page f!^ (2). 
□ 



Corollary 7.1.21 

Every derived obligation is a classical consequence of the original set of obligations in the independent set case. 
Proof 

This follows from Fact 17.1.141 fpage [T3"6l) (3) and Fact 17.1.201 (page [135)) . 
Example 7.1.14 

The Ross paradox is not a derived obligation. 
Proof 

Suppose we have the alphabet p, q and the obligations {p, q}, let R := pV^q. This is not not closed, as -^pAq -< -ipA->q G R. □ 



7.1.6 Conclusion 

Obligations differ from facts in the behaviour of negation, but not of conjunction and disjunction. The Ross paradox 
originates, in our opinion, from the differences in negation. Central to the treatment of obligations seems to be a relation 
of "better", which can generate obligations, but also be generated by obligations. The connection between obligations and 
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7.2 A comment on work by Aqvist 

7.2.1 Introduction 

The article |AqvOO| discusses three systems, which are presented now in outlines. (When necessary, we will give details 
later in this section.) 

(1) The systems Hm, where m G u>. The (Kripke style) semantics has for each i G w a subset opti of the model set, s.t. 
the opU form a disjoint cover of the model set, all opti for all i < m are not-empty, and all other opti are empty. The opti 
interpret new frame constants Qi. The opti describe intuitively levels of perfection, where opti is the best, and opt m the 
worst level. 

(2) The dyadic deontic logics Gm, m £ to. The semantics is again given by a cover opti as above, and in addition, a function 
best, which assigns to each formula cf> the "best" models of 4>, i.e. those which are in the best opti set. The language has 
the Qi operators, and a new binary operator 0(<f>/ip) (and its dual P(./.)), which expresses that in the best models </> 
holds. Note that there is no explicit "best" operator in the language. 

(3) The dyadic deontic logic G. The semantics does not contain the opti any more, but still the "best" operator as in 
case (2), which now corresponds to a ranking of the models (sufficient axioms are given). The language does not have the 
Qi any more, but contains the O (and P) operator, which is interpreted in the natural way: 0(4>/ifj) holds iff the best 
■0— models are 0— models. Note that again there is no explicit "best" operator in the language. 

Thus, it corresponds to putting the |~ —relation of ranked models in the object language. 

In particular, there is no finiteness restriction any more, as in cases (1) and (2) - and here lies the main difference. 
Aqvist gives a Theorem (Theorem 6) which shows (among other things) that 
If G is a G— sentence, provable in G, then it is also provable in Gm. 

The converse is left open, and Aqvist thought to have found proof using another result of his, but there was a loophole, as 
he had found out. 

We close this hole here. 

7.2.2 There are (at least) two solutions 

(1) We take a detour via a language which contains an operator j3 to be interpreted by "best". 

(2) We work with the original language. 

As a matter of fact, Aqvist's paper contains already an almost complete solution of type (1), just the translation part 
is lacking. We give a slightly different proof (which will be self contained), which works with the original (i.e. possibly 
infinite) model set, but reduces the quantity of levels to a finite number. Thus, the basic idea is the same, our technique 
is more specific to the problem at hand, thus less versatile, but also less "brutal" . 

Yet, we can also work with the original language, even if the operator 0(<j)/ip) will not allow to describe "best" as the 
operator /3 can, we can approximate "best" sufficiently well to suit our purposes. (Note also that O allows, using all 
formulas, to approximate "best" from above: best(<fi) = f]{M(ip) : 0(tp/(f>)}.) 

We may describe the difference between solution (1) and (2) as follows: Solution (1) will preserve the exact "best" value 
of some - but not necessarily all, and this cannot be really improved - sets, solution (2) will not even allow this, but still, 
we stay sufficiently close to the original "best" value, so the formula at hand, and its subformulas, will preserve their truth 
values. 

In both cases, the basic idea will be the same: We have a G— model for cf>, and now construct a Gm— model (i.e. with 
finitely many levels) for <fi. 4> is a finite entity, containing only finitely many propositional variables, say pi, . . . ,p„. Then 
we look at all set of the type ±p\ A ... A ±|> n , where ± is nothing or the negation symbol. This is basically how fine our 
structure will have to be. If we work with /3, we have to get better, as (3{pi) will, in general, be something new, so will 
P(pi A -i/3(pi)), etc., thus we have to take the nesting of /3's in into account, too. (This will be done below.) If we work 
directly with the 0{a/ (3) operator, then we need not go so far down as 0(a) (3) will always evaluate to (universal) true or 
false. If e.g. 4> contains 0(a/f3) and 0(o///3), then we will try to define the "best" elements of M(f3) as M(J3 A a A a'), 
etc. We have to be careful to make a ranking, i.e. 0(.,true) will give the lowest layer, and if ip h ip', 0((f>/ip), 0(^(f>/i[>), 
then the rank of ip' will be strictly smaller than the rank of ip, etc. This is all straightforward, but a bit tedious. 

As said, we will take the first approach, which seems a bit "cleaner" . 

We repeat now the definitions of |AqvOO only as far as necessary to understand the subsequent pages. In particular, we 
will not introduce the axiom systems, as we will work on the semantic side only. 

All systems are propositional. 

Definition 7.2.1 

The systems Hm, m £ u>. 
The language: 

A set Prop of propositional variables, True, the usual Boolean operators, N (universal necessity), a set {Qi : i e lo} of 
systematic frame constants (zero place connectives like True) (and the duals False, M). 

The semantics: 

M = (W, V, {opti ■ i £ m), where W is a set of possible worlds, V a valuation as usual, each opti is a subset of W. 
Validitv: 
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M, x \= p iff x G V(p) for x G W, p G Prop, 
M, x \= True, 

M,x\= N(j) iff for all y G M, y |= 0, 
M, x |= Qi iff a; G opti . 
Conditions on the opU : 

(a) opti fl optj = if i ^ j, 

(b) opti U . . . U opt m = W, 

(c) opti ^ for i < m, 

(d) opti = for « > m. 

Definition 7.2.2 

The systems Gm, m G ui. 
The language: 

It is just like that for the systems 5m, with, in addition, a new binary connective 0(<ft/ip), with the meaning that the 
"best" ip— worlds satisfy ( an d its dual). 

The semantics: 

It is also just like the one for 5m, with, in addition, a function B (for "best" ), assigning to each formula a set of worlds, 

the best 0— worlds, thus M = (W, V, {opti ■ i G co}, B, m). 

Validity: 

M, x \= 0(<t>, ip) iff B(ip) C M(4>). 
Conditions: 

We have to connect B to the opU, this is done by the condition (7O) in the obvious way: 

(7O) x G B{4>) iff M, x \= (j> and for each y G W, if M, y \= <j>, then x is at least as good as y - i.e. there is no strictly lower 
opti — level where <f> holds. 

Definition 7.2.3 

The system G. 
The language: 

It is just like that for the systems Gm, but without the Qi. 

The semantics: 

It is now just M = (W, V, B). 

Validity: 

As for Gm. 

Conditions: 

We have no opti now, which are replaced by suitable conditions on B, which make B the choice function of a ranked 
structure: 

M M(4>) = M(<f/) - 5(0) = B{<P'), 

(o- 2 ) B{4>) n M(ip) C 5(0 A V), 
(o- 3 ) M(0) ^ - S(0) ^ 0, 

(ct 4 ) B{(j}) n M(V>) + 5(0 A V) C 5(0) n M(^). 
7.2.3 Outline 

We first show that the language (and logic) G may necessitate infinitely many levels (whereas the languages Gm (m G w) 
admit only finitely many ones). Thus, when trying to construct a Gm— model for a G— formula 0, we have to construct 
from a possibly "infinite" function 5 via finitely many opti levels a new function 5' with only finitely many levels, which is 
sufficient for the formula under consideration. Crucial for the argument is that is a finite formula, and we need only a 
limited degree of discernation for any finite formula. Most of the argument is standard reasoning about ranked structures, 
as it is common for sufficiently strong preferential structures. 

We will first reformulate the problem slightly using directly an operator /3, which will be interpreted by the semantic 
function 5, and which results in a slightly richer language, as the operators O and P can be expressed using 5, but not 
necessarily conversely. Thus, we show slightly more than what is sufficient to solve Aqvist's problem. 

We make this official: 

Definition 7.2.4 

The language G' is like the language G, without 0(./.) and P{-/ ■), but with a new unary operator (3, which is interpreted 
by the semantic function 5. 
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(1) 0(<fr/ip) can thus be expressed by N(f3(tp) — > <j>) - N universal necessity. In particular, and this is the important aspect 
of (3, f3(4>) is now (usually) a non-trivial set of models, whereas O and P result always in or the set of all models. 

(2) Note that by (<t3), if M{<j>) ^ 0, then M((i{<p)) ^ 0, but it may very well be that M{j3{<j>)) = M{<p) - all ^-models may 
be equally good. (M((f>) is the set of (f>— models.) 

□ 



We use now an infinite theory to force B to have infinitely many levels. 
Example 7.2.1 

We construct uj many levels for the models of the language {pi : i G oj}, going downward: 

Top level: M(p ) 

second level: M(^p Api) 

third level: M(^p A AP2), etc. 

We can express via (3, and even via 0(./.), that these levels are all distinct, e.g. by (3(p V (-ipo ^ Pi)) \= ~*Po, or by 
O(^po/po V (->po Api)), etc. So we will necessarily have u> many non-empty opti — levels, for any B which satisfies the 
condition connecting the opti and B, condition (70). 

□ 



We work now in the fixed G— model T (with the (i operator.) Let in the sequel all X, Y, X t be non-empty (to avoid trivial 
cases) model sets. By prerequisite, B satisfies (ctO) — (cr4). 

Definition 7.2.5 

(1) X ~< Y iff B(X U Y) n Y = 0, 

(2) X ~ Y iff B(X U Y) n X ^ and B(X U Y) n Y + 0, i.e. iff X ^ F and V /X. 

Remark 7.2.2 

(1) ~< and ~ behave nicely: 

(1.1) -< and ~ are transitive, ~ is reflexive and symmetric, and for no X X -< X. 

(1.2) -<; and - cooperate: X<Y~Z^>X<Zaxi&X>-Y~Z^>X)~Z 
Thus, -< and ~ give a ranking. 

(2.1) B(X U Y) = B(X) if X -< Y 

(2.2) B(X UY) = B(X) U B(Y) if X ~ Y 

(3) S(Xi U . . . U X„) = \J{B(Xi) : SX^Xj < X t )} 1 < i,j < n, - i.e. B(X 1 U . . . U X n ) is the union of the B{Xi) with 
minimal -<! —rank. 

□ 



We come to the main idea and its formalization. 

Let a fixed (f> be given. Let (f> contain the propositional variables p\, . . . ,p m , and the operator /3 to a depth of nesting n. 

Definition 7.2.6 

(1) Define by induction: 

Elementary sets of degree 0: all intersections of type M(±pi) fl M(±p2) fl . . . (~1 M(±p TO ), where ±pj is either or -ipj. 
Elementary sets of degree i + 1 : If s is an elementary set of degree i, then B(s) and s — B(s) are elementary sets of degree 

(2) Unions of degree i are either or (arbitrary) unions of elementary sets of degree i. 

This definition goes up to i = n, though we could continue for all i Geo, but this is not necessary. 

(3) £ is the set of all elementary sets of degree n. 

Remark 7.2.3 

(1) for any degree i, the elementary sets of degree i form a disjoint cover of Mc - the set of all classical models of the 
language. 

(2) The elementary sets of degree i + 1 form a refinement of the elementary sets of degree i. 
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(4) If A is a union of degree i, B(X) and X — B(X) are unions of degree i + 1. 

(5) A union of degree i is also a union of degree j if j > i. 
□ 

We construct now the new Gm— structure. We first define the opti levels, and from these levels, the new function B' . Thus, 
(7o) w ih hold automatically. As we know from above example, we may forcibly loose some discerning power, so a word 
where it is lost and where it goes may be adequate. The construction will not look inside B(s) and s — B(s) for s G £. 
For B(s), this is not necessary, as we know that all elements inside B(s) are on the same level (B(s) = B(B(s))), inside 
s — B(s), the original function B may well be able to discern still (even infinitely many) different levels, but our fomula <j> 
does not permit us to look down at these details - s — B(s) is treated as an atom, one chunk without any details inside. 

Definition 7.2.7 

We define the rank of X G £, and the opti and B' . This is the central definition. Let X ^ 0, X G £. 

(1) Set rank{X) = 0, iff there is no Y -< X, Y G £. 

Set rank(X) = i + 1, iff X e £ - {Z : rank(z) < 1} and there is no Y -< X, Y G £ - {Z : rank(z) < i}. 
So, rank(X) is the -< —level of X. 

(2) Set opti := IJ{A G £ : rank(X) = i}, and opti = iff there is no X s.t. rank(X) = i. 

(3) B'(A) := An opU, where i is the smallest j s.t. A n optj ^ 0, for A ^ 0. 

Remark 7.2.4 

(1) The opti form again a disjoint cover of Mc, all opti arc 7^ up to some k, and beyond. 

(2) (70) will now hold by definition. 

(3) B' is not necessarily B, but sufficiently close. 

Lemma 7.2.5 

(Main result) 

For any union X of degree k < n, B(X) = B'{X). 
Proof 

Write A as a union of degree n— 1, and let X := {A' C A : A' is of degree n— 1}. Note that the construction of -< / <~ /opt 
splits all X' £ X into B(X') and A' — B(X'), both of degree n, and that both parts are always present, there will not be 
any isolated A' — B(X') without its counterpart B(X'). 

Let A' G X. Then (A' - B(X')) n B'(X) = 0, as the opt-level of B(X') is better than the opt-level of A' - B(X'). 
Obviously, also (A' - B(X')) n B(X) = 0. Thus, B(X) and B'(A) arc the union of certain B(X'), A' G X. Suppose 
B(X') C B(A) for some A' G X. Then for no A" G * B(B(X') U B(X")) n B(A') = 0, so ^(A') has minimal opt-level 
in A, and B(A') C B'(X). Conversely, let B(X') % B(X). Then there is A" G X s.t. B(B(X') U B(X")) (~l -B(A') = 0, so 
B(A') has not minimal opt-level in A, and -B(A') fl B'(X) = 0. □ 



Corollary 7.2.6 

If 4>' is built up from the ingredients of <f>, then [<j>'] — [</>']' - where {<j)'\ {[(j)'] 1 ) is the set of models where 4>' holds in the 
original (new) structure. 

Proof 

Let $ be the set of formulas which are built up from the ingredients of <j>, i.e. using (some of) the propositional variables 
of (j), and up to the nesting depth of (3 of 4>. 

Case 0: Let p G $ be a propositional variable. Then \p] — \p]' , as we did not change the classical model. Moreover, [p] is a 
union of degree 0. 

Case 1: Let <f>', <fi" G $, and let [</>'] = [</>']', [</>"] = [4>"Y be unions of degree k' and k" respectively, and let k := max(k' , fc") < 
n. Then both are unions of degree k, and so are [<j>'A<f>"], \-^<j>'\ etc., and [cfi'Acj)"] = [<$>' A<f>"\' , as A is interpreted by intersection, 
etc. Let now k < n. We have to show that [/3(<j>')] = [/5 '<(<//)]' '. But [/J(^')] = B([<f/]) = B' ([</>']) = B' ([</>']') = [f3(<j>')]' by 
above Lemma and induction hypothesis. 

□ 



Example 7.2.2 
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p A ^(3(p), ->p A — ip) on the worst non-empty level 

(3(-*p) on the best level. 

When we calculate now B'(M(p)) via opt, we decompose p in its components /3(p) and p — (3(p), and see that (3{p) is on a 
better level than p - (3(p), so /3'(M(p)) = 0(M(p)), as it should be. □ 



We finally have: 



7.2.4 Gm h A implies G \- A (Outline) 

We show that Vm(Gm h A) implies G h ^4 (i a G— formula). 

As L.Aqvist has given equivalent semantics for both systems, we can argue semantically. We turn the problem round: If 
G \f A, then there is m s.t. Gm \f A. Or, in other words, if there is a G— model and a point in it, where A does not hold, 
then we find an analogue Gm— model, or, still differently, if cf) is any G— formula, and there is a G— model T and a point 
x in T, where <j) holds, then we find m and Gm— model A, and a point y in A where <j> holds. 

By prerequisite, <f> contains some propositional variables, perhaps the (absolute) quantifiers M and N, usual connectives, 
and the binary operators O and P. Note that the function "best" intervenes only in the interpretation of O and P. Moreover, 
the axioms u% express that best defines a ranking, e.g. in the sense of ranked models in preferential reasoning. In addition, 
(T3 is a limit condition (which essentially excludes unbounded descending chains). 

Let (j) contain n symbols. We thus use "best" for at most n different tp, where 0((f>' /ip) (or P((j>'/ip j) is a subformula of 4>. 

We introduce now more structure into the G— model. We make m := n + 1 layers in G's universe, where the first n 
layers are those mentioned in (f>. More precisely, we put the best V— models for each ip mentioned as above in its layer 
- respecting relations between layers when needed (this is possible, as the Oi are sufficiently strong), and put all other 
-0— models somewhere above. The fact that we have one supplementary layer (which, of course, we put on top) guarantees 
that we can do so. The opU will be the layers. 

We then have a Gm— model (if we take a little care, so nothing gets empty prematurely), and <j) wu l hold in our new 
structure. 

□ 



7.3 Hierarchical conditionals 

7.3.1 Introduction 

7.3.1.1 Description of the problem 

We often see a hierarchy of situations, e.g.: 

(1) it is better to prevent an accident than to help the victims, 

(2) it is better to prove a difficult theorem than to prove an easy lemma, 

(3) it is best not to steal, but if we have stolen, we should return the stolen object to its legal owner, etc. 
On the other hand, it is sometimes impossible to achieve the best objective. 

We might have seen the accident happen from far away, so we were unable to interfere in time to prevent it, but we can 
still run to the scene and help the victims. 

We might have seen friends last night and had a drink too many, so today's headaches will not allow us to do serious work, 
but we can still prove a little lemma. 

We might have needed a hammer to smash the windows of a car involved in an accident, so we stole it from a building 
site, but will return it afterwards. 

We see in all cases: 

- a hierarchy of situations 

- not all situations are possible or accessible for an agent. 
In addition, we often have implicitly a "normality" relation: 

Normally, we should help the victims, but there might be situations where not: This would expose ourselves to a very big 
danger, or this would involve neglecting another, even more important task (we are supervisor in a nuclear power plant 
....), etc. 

Thus, in all "normal" situations where an accident seems imminent, we should try to prevent it. If this is impossible, in 
all "normal" situations, we should 1ic1d the victims, etc. 
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(1) normality, 

(2) hierarchy, 

(3) accessibility 

in the present paper. 

Note that it might be well possible to give each situation a numerical value and decide by this value what is right to do - 
but humans do not seem to think this way, and we want to formalize human common sense reasoning. 

Before we begin the formal part, we elaborate above situations with more examples. 

• We might have the overall intention to advance computer science. 

So we apply for the job of head of department of computer science at Stanford, and promise every tenured scientist 
his own laptop. 

Unfortunately, we do not get the job, but become head of computer science department at the local community 
college. The college does not have research as priority, but we can still do our best to achieve our overall intention, 
by, say buying good books for the library, or buy computers for those still active in research, etc. 

So, it is reasonable to say that, even if we failed in the best possible situation - it was not accessible to us - we still 
succeeded in another situation, so we achieved the overall goal. 

• The converse is also possible, where better solutions become possible, as is illustrated by the following example. 

The daughter and her husband say to have the overall intention to start a family life with a house of their own, and 
children. 

Suppose the mother now asks her daughter: You have been married now for two years, how come you arc not 
pregnant? 

Daughter - we cannot afford a baby now, we had to take a huge mortgage to buy our house and we both have to 
work. 

Mother - I shall pay off your mortgage. Get on with it! 

In this case, what was formerly inaccessible, is now accessible, and if the daughter was serious about her intentions - 
the mother can begin to look for baby carriages. 

Note that we do not distinguish here how the situations change, whether by our own doing, or by someone else's 
doing, or by some events not controlled by anyone. 

• Consider the following hierarchy of obligations making fences as unobtrusive as possible, involving contrary to duty 
obligations. 

(1) You should have no fence (main duty). 

(2) If this is impossible (e.g. you have a dog which might invade neighbours' property), it should be less than 3 feet 
high (contrary to duty, but second best choice). 

(3) If this is impossible too (e.g. your dog might jump over it), it should be white (even more contrary to duty, but 
still better than nothing). 

(4) If all is impossible, you should get the neighbours' consent (etc.). 
7.3.1.2 Outline of the solution 

The last example can be modelled as follows (u(x) is the minimal models of x) : 
Layer 1: fj,(True) : all best models have no fence. 

Layer 2: /j(fence) : all best models with a fence are less than 3 ft. high. 

Layer 3: fj,(fence and more than 3 ft. high): all best models with a tall fence have a white fence. 

Layer 4: /i( fence and non- white and > 3 ft): in all best models with a non- white fence taller than 3 feet, you have 
permission 

Layer 5: all the rest 

This will be modelled by a corresponding A— structure. 
In summary: 

(1) We have a hierarchy of situations, where one group (e.g. preventing accidents) is strictly better than another group 
(e.g. helping victims). 

(2) Within each group, preferences are not so clear (first help person A, or person B, first call ambulance, etc.?). 

(3) We have a subset of situations which are attainable, this can be modelled by an accessibility relation which tells us 
which situations are possible or can be reached. 
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Each layer behaves inside like any preferential structure. 
Amongst each other, layers behave like ranked structures. 

A— ranked structure 

Diagram 7.3.1 



We combine all three ideas, consider what we call A— ranked structures, structures which are organized in levels A\, A 2 , 
A 3 , etc., where all elements of Ax are better than any element of A 2 - this is basically rankedness -, and where inside each 
Ai we have an arbitrary relation of preference. Thus, an A— ranked structure is between a simple preferential structure 
and a fully ranked structure. 

Sec Diagram I7XT1 (page [T45)) . 

Remark: It is not at all necessary that the rankedness relation between the different layers and the relation inside the layers 
express the same concept. For instance, rankedness may express deontic preference, whereas the inside relation expresses 
normality or some usualness. 

In addition, we have an accessibility relation R, which tells us which situations are reachable. 

It is perhaps easiest to motivate the precise choice of modelling by layered (or contrary to duty) obligations. 

For any point t, let R(t) :— {s : tRs}, the set of R— reachable points from t. Given a preferential structure X := (X, -<), 
we can relativize X by considering only those points in X, which are reachable from t. 

Let X' C X, and n(X') the minimal points of X, we will now consider n(X r ) n R(t) - attention, not: /j,(X' n R(t))\ This 
choice is motivated by the following: norms are universal, and do not depend on one's situation t. 

If X describes a simple obligation, then we are obliged to Y iff n(X') n R(t) ^ 0, and n(X') D R(t) C Y. The first clause 
excludes obligations to the unattainable. We can write this as follows, supposing that X' is the set of models of <f>' , and Y 
is the set of models of -0 : 

to |= 4>' > ip. 

Thus, we put the usual consequence relation (~ into the object language as >, and relativize to the attainable (from to). 

If an A— ranked structure has two or more layers, then we are, if possible, obliged to fulfill the lower obligation, e.g. prevent 
an accident, but if this is impossible, we are obliged to fulfill the upper obligation, e.g. help the victims, etc. 

See Diagram 17X21 (page [T4"5|) . 
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The overall structure is visible from t 

Only the inside of the circle is visible from s 

Half-circles are the sets of minimal elements of layers 



• 5* 

t R s 




A— ranked structure and accessibility 

Diagram 7.3.2 



Let now, for simplicity, B be a subset of the union of all layers A, and let B be the set of models of j3. This can be done, 
as the individual subset can be found by considering An B, and call the whole structure (A, B). 

Then we say that m satisfies (A, B) iff in the lowest layer A where fi(A) D R(m) ^ fi(A) n R(m) C B. 

When we want a terminology closer to usual conditionals, we may write e.g. (Ai > B\\Ai > Bi\ . . . .) expressing that 
the best is Ax, and then B\ should hold, the second best is A 2 , then B 2 should hold, etc. (The Bi are just A, n B.) See 
Diagram 17.3.31 (page I150[) . 

7.3.2 Formal modelling and summary of results 

We started w ith an invest igation of "best fulfillment" of abstract requirements, and contrary to duty obligations. - See 
also |Gab08j and |Gab08aj . 

It soon became evident that semi-ranked preferential structures give a natural semantics to contrary to duty obligations, 
just as simple preferential structures give a natural semantics to simple obligations - the latter goes back to Hansson 
|Han69j . 

A semi-ranked - or A— ranked preferential structure, as we will call them later, as they are based on a system of sets A - 
has a finite number of layers, which amongst them are totally ordered by a ranking, but the internal ordering is just any 
(binary) relation. It thus has stronger properties than a simple preferential structure, but not as strong ones as a (totally) 
ranked structure. 

The idea is to put the (cases of the) strongest obligation at the bottom, and the weaker ones more towards the top. Then, 
fulfillment of a strong obligation makes the whole obligation automatically satisfied, and the weaker ones are forgotten. 

Beyond giving a natural semantics to contrary to duty obligations, semi-ranked structures seem very useful for other 
questions of knowledge representation. For instance, any blackbird might seem a more normal bird than any penguin, but 
we might not be so sure within each set of birds. 

Thus, this generalization of preferential semantics seems very natural and welcome. 

The second point of this paper is to make some, but not necessarily all, situations accessible to each point of departure. 

rr-ii !f • • j_ . j_ - i_ _ _ j_ -j- „ r__ii2n - r _ _ • _i_ i_ - i_ l - j_ - j_ _ 7 r_ • i. 
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denoting situations which can be reached. If this relation is transitive, then we have restrictions on the set of reachable 
situations: if p is accessible from p' , and p can access situation s, then so can p' ', but not necessarily the other way round. 

On the formal side, we characterize: 

(1) A— ranked structures, 

(2) satisfaction of an A— ranked conditional once an accessibility relation between the points p, p' , etc. is given. 

For the convience of the reader, we now state the main formal results of this paper - together with the more unusual 
definitions. 

On (1): 

Let A be a fixed set, and A a finite, totally ordered (by <) disjoint cover by non-empty subsets of A. 

For x £ A, let rg{x) the unique A £ A such that x £ A, so rg(x) < rg(y) is defined in the natural way. 

A preferential structure (X,-<) (X a set of pairs (x,i)) is called .A— ranked iff for all x,x' rg(x) < rg(x') implies (x,i) ~< 
(x',i'} for all (x,i),(x',i') £ X. See Definition 14.1.11 (page [57|l for the definition of preferential structures, and Diagram 
17.3.11 (page 1145)) for an illustration. 

We then have: 

Let |- be a logic for C. Set T M := Th(fi M (M(T))), and T := {0 : T (~ 0}. where M is a preferential structure. 

(1) Then there is a (transitive) definability preserving classical preferential model M. s.t. T = T M iff 
(LLE), (CCL), (SC), (PR) hold for all T, T" C C. 

(2) The structure can be chosen smooth, iff, in addition 
(CUM) holds. 

(3) The structure can be chosen A— ranked, iff, in addition 

{A— min) T \f -<on and T \f ~<otj, i < j implies T I — >ctj 
holds. 

See Definition 14.1.21 (page [55)) for the logic defined by a preferential structure, Definition 12.31 (page [311)) for the logical 
conditions, Definition 14.1.31 fpage [55|) for smoothness. 

On (2) 

Given a transitive accessibility relation R, R(m) := {x : mRx}. 

Given A as above, let B C A be the set of "good" points in A, and set C := (A, B). 
We define: 

(1) fi(A) := LKM^i) 
(warning: this is NOT //(A)) 

(2) An := JZ(m) n A, 

(3) n{Am) := UiK^i) n R(m) : i E 1} 
(3a) i/(An) := M(MAn)) 

(thus y(w4m) = {a G A : £ A(a £ /Lt(A), a £ J?(m), and 
-n3a'(3A' £ A(a' £ n{A'), a' £ i?(m), a' -< a}. 

(4) m |= C :^ v{A m )) C B. 
See Diagram 17.3.31 (page I150[) 
Then the following hold: 

Let m, m' £ M, A, A' £ A, A be the set of models of a. 

(1) m |= n-io:, mRm' rri' |= D-ia 

(2) mi?m', n A ^ 0, i/(i m -) n A' ^ 0, A < A' (in the ranking) 

(3) mRm', v{A m ) H A ^ 0, KAn') H A' ^ 0, m (= C, m' ^ C, => A < A' 

Conversely, these conditions suffice to construct an accessibility relation between M and A satisfying them, so they are 
sound and complete. 



148 CHAPTER 7. DEONTIC LOGIC AND HIERARCHICAL CONDITIONALS 

7.3.3 Overview 

We next point out some connections with other domains of artificial intelligence and computer science. 

We then put our work in perspective with a summary of logical and semantical conditions for nonmonotonic and related 
logics, and present basic defintions for preferential structures. 

Next, we will give special definitions for our framework. 

We then start the main formal part, and prove representation results for A— ranked structures, first for the general case, 
then for the smooth case. The general case needs more work, as we have to do a (minor) modification of the not A— ranked 
case. The smooth case is easy, we simply have to append a small construction. Both proofs are given in full detail, in order 
to make the text self-contained. 

Finally, we characterize changes due to restricted accessibility. 
Definition 7.3.1 

We have the usual framework of preferential structures, i.e. either a set with a possibly non-injective labelling function, 
or, equivalently, a set of possible worlds with copies. The relation of the preferential structure will be fixed, and will not 
depend on the point m from where we look at it. 

Next, we have a set A, and a finite, disjoint cover Ai : i < n of A, with a relation "of quality" <, A will denote the Ai 
(and thus A), and <, i.e. A = ({A: :«£/},<). 

By Fact 15.2.111 (page 150)) , we may assume that all Ai are described by a formula. 

Finally, we have B C A, the subset of "good" elements of A - which we also assume to be described by a formula. 

In addition, we have a binary relation of accessibility, R, which we assume transitive - modal operators will be defined 
relative to R. R determines which part of the preferential structure is visible. 

Let R(s) := {t : sRt}. 
Definition 7.3.2 

We repeat here from the introduction, and assume Ai — M(on), B = M(/3), and /i expresses the minimality of the 
preferential structure. 

(|=a,>|3:« (J>(Ai) n R(t) C B, 

we will also abuse notation and just write 

t \= Ai > B in this case. 

We then define: 

t \= C iff at the smallest i s.t. fi(Ai) n R(t) ^ 0, fi(Ai) n R(t) C B holds. 
This motivates Definition 15.2.11 (page [57)1 . 

Note that automatically for X C A, fi(X) C Aj when j is the smallest i s.t. X n A, ^ 0. 

The idea is now to make the Ai the layers, and "trigger" the first layer Aj s.t. /i(A,-) n R(x) ^ 0, and check whether 
fi(Aj) n R(x) C Bj. A suitable ranked structure will automatically find this Aj. 

More definitions and results for such A and C will be found in Section [7. 3. 51 (page [149]). 

7.3.4 Connections with other concepts 
7.3.4.1 Hierarchical conditionals and programs 

Our situation is now very similar to a sequence of computer program instructions: 

if A\ then do B\\ 

else if A 2 then do B 2 ; 

else if A 3 then do B 3 ; 

where we can see the Bi as subroutines. 

We can deepen this analogy in two directions: 

(1) connect it to Update 

(2) put an imperative touch to it. 

In both cases, we differentiate between different degrees of fulfillment of C : the lower the level is which is fulfilled, the 
better. 

(1) We can consider all threads of reachability which lead to a model m where m\= C. Then we take as best threads those 
which lead to the best fulfillment of C. So degree of fulfillment gives the order by which we should do the update. (This 
is then not update in the sense that we choose the most normal developments, but rather we actively decide for the most 
desirable ones.) We will not pursue this line any further here, but leave it for future research. 

(2) : We introduce an imperative operator, say!.! means that one should fulfill C as best as possible by suitable choices. 
We will elaborate this now. 

First, we can easily compare the degree of satisfaction of C of two models: 
Definition 7.3.3 
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For two sets of models, X, X', the situation does not seem so easy. So suppose that X, X' \= C. First, we have to decide 
how to compare this, we do by the maximum: X < X' iff the worst satisfaction of all x G X is better than the worst 
satisfaction in X' . More precisely, we look at all 7(C) for all x G X, take the maximum (which exists, as A is finite), and 
then compare the maxima for X and for X' . 

Suppose now that there are points where we can make decisions {"free will"), let m be such a point. We introduce a 
new relation D, and let mDm' iff we can decide to go from m to m! . The relation D expresses this possibility - it is our 
definition of "free will" . 

Definition 7.3.4 

Consider now some formula <fr, and define 
m \=lcj) :=> D(m) n M((f>) < D(m) n M (-«f>) 
(as defined in Definition 17.3.31 (page I148|) ) . 



7.3.4.2 Connection with Theory Revision 

In particular, the situation of contrary to duty obligations (see Section 17.3.11 (page 11431) ) shows an intuitive similarity to 
revision. You have the duty not to have a fence. If this is impossible (read: inconsistent), then it should be white. So the 
duty is revised. 

But there is also a formal analogy: As is well known, ACM revision (with fixed left hand side K) corresponds to a ranked 
order of models, where models of K have lowest rank (or: distance from if— models). The structures we consider 
(.4— rankings) are partially ranked, i.e. there is only a partial ranked preference, inside the layers, nothing is said about 
the ordering. This partial ranking is natural, as we have only a limited number of cases to consider. 

But we use the revision order (based on K, so it really is a <k relation) differently: We do not revise K, but use only the 
order to choose the first layer which has non-empty intersection with the set of possible cases. Still, the spirit (and formal 
apparatus) of revision is there, just used somewhat differently. The if— relation expresses here deontic quality, and if the 
best situation is impossible, we choose the second best, etc. 

Theory revision with variable K is expressed by a distance between models (see LMS01 ), where K * <\> is defined by the 
set of <fi models which have minimal distance from the set of K models. 

We can now generalize our idea of layered structure to a partial distance as follows: For instance, d(K, A) is defined, 
d(K, B) too, and we know that all A models with minimal distance to K have smaller distance than the B models with 
minimal distance to K. But we do NOT know a precise distance for other A models, we can sometimes compare, but not 
always. We may also know that all A models are closer to K than any B model is, but for a and a', both A models, we 
might not know if one or the other is closer to K, or is they have the same distance. 



The representation results for A— ranked structures were shown already in Section r5.2l (pagc l87[) . so we can turn immediately 
to the following: 

7.3.5 Formal results and representation for hierarchical conditionals 

We look here at the following problem: 
Given 

(1.1) a finite, ordered partition A of A, A = ({A{ :i e I}, <) 

(1.2) a normality relation -<, which is an A— ranking, defining a choice function fi on subsets of A, (so, obviously, A < A' 
iff n(A U A') f)A' = 0), 

(1.3) a subset B C A, and we set C := (A, B) (thus, the Bi are just Ai n B, this way of writing saves a little notation), 

(2.1) a set of models M, 

(2.2) an accessibility relation R on M, with some finite upper bound on R— chains, 

(2.3) an unknown extension of R to pairs (m, a), m € M, a 6 A, 

(3.1) a notion of validity m\= C, for m G M, defined by m \= C iff {a G A : 3A G A(a G [J>(A), a G R(m), and 

^3a'(3A' G A(a' G fi(A'), a' G i?(m), a' -< a} C B, 

(3.2) a subset M' of M 

give a criterion which decides whether it is possible to construct the extension of R to pairs (m, a) s.t. Vto G M,(m G M 1 
& m \= C). 

We first show some elementary facts on the situation, and give the criterion in Proposition ^. 3. 41 (page [T?T|) . together with 
the proof that it does what is wanted. 
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Reachability for a transitive relation is characterized by 
y e R(x) -> R(y) C R{x) 

Proof 

Dchne directly xRz iff z £ R{x). This does it. □ 



Let now S be a set with an accessibility relation i?', generated by transitive closure from the intransitive subrelation R. 
All modal notation will be relative to this R. 

Let A = M(a), A, = M(c*i), the latter is justified by Fact 15.2.111 ( page 1501) . 

Definition 7.3.5 

(1) fi(A) := \j{n(Ai) --^1} 
(warning: this is NOT fi(A)) 

(2) Am := R(m) n A, 

(3) jtx(An) := U{M(^) n iZ(m) :ie/} 
(3a) v(A m ) ■= n(p(Am)) 

(thus v(A m ) = {a E A : 3 A G ,A(a £ a £ i?(m), and 

-da'pA' S _4(a' 6 /i(A'), a' £ i?(m), a' -< a}. 

(4) m |= C :<-» KA™)) C B. 
See Diagram (page [inni) 




We have the following Fact for to \= C : 
Fact 7.3.2 

Let to, to' e M, A, A' e A. 

(1) to |= U-ioi, mRm' =>• to' |= CHa 

(2) rroRm', i/(An) n A ^ 0, i/(An') n A' ^ 0, => A < A' 

(3) mRm', v{A m ) n A ^ 0, i/(i m ,) n A' ^ 0, to |= C, to' ^ C, => A < A' 

Proof 

Trivial. □ 
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Fact 7.3.3 

We can conclude from above properties that there are no arbitrarily long R— chains of models to, changing from m \= C to 
m,y=C and back. 

Proof 

Trivial: By Fact 17.3/21 (page [150)) . (3), any change from |= C to ^ C results in a strict increase in rank. □ 

We solve now the representation task described at the beginning of Section l7.3.5l (page [T4"5)) . all we need are the properties 
shown in Fact 17.3.21 (page [150)1 . 

(Note that constructing R between the different m, to' is trivial: we could just choose the empty relation.) 
Proposition 7.3.4 

If the properties of Fact 17.3.21 (page 1150)) hold, we can extend R to solve the representation problem described at the 
beginning of this Section 17.3.51 (page 114=9)) . 

Proof 

By induction on R. This is possible, as the depth of R on M was assumed to be finite. 
Construction 7.3.1 

We choose now elements as possible, which ones are chosen exactly does not matter. 
X, := {b u a} iff (i(Ai) riB/O and fi(Ai) - B 0, b t <E ^(A,) n B, c 2 6 (jt(Ai) - B. 
X, := {a} iff fi(Ai) n B = and /_t(Ai) - B j= 0, a G fi(Ai) - B 
Xi := {b t } iff fi(Ai) n B f and fi(Ai) - B = 0, b t e fi(Ai) n B, 
Xt := iff n{Ai) = 0. 
Case 1: 

Let m be R— minimal and m \= C. Let io be the first i s.t. b t £ Xi, make 7(m) := io, and make R(m) :— {bi } U U{-Yi : 
i > io}. This makes C hold. (This leaves us as many possibilities open as possible - remember we have to decrease the set 
of reachable elements now.) 

Case 2: 

Let to be R— minimal and m ^= C. Let io be the first i s.t. c$ G Xi, make j(m) := io, and make R(m) := IJ{X; : i > io}- 
This makes C false. 

Let all R— predecessors of m be determined, and i := max{"/(m') : m'Rm}. 

Case 3: to |= C. Let j be the smallest i' > i with /i(Aj/) n B ^ 0. Let R(m) := U lj{^/c : k > j}, and 7(m) := j. 
Case 4: 771 ^ C. 

Case 4.1: For all m'Rm with i = 7(771') to' ^= C. 
Take one such to' and set R(m) := R(m'), 7(771) := i. 
Case 4.2: There is m'Rm with i = f3(m') to' |= C 

Let j be the smallest i' > i with /u(Aj') — 5^0. Let R(m) := lJ{Xfe : fc > j}. (Remark: To go from |= to we have to 
go higher in the hierarchy.) 

Obviously, validity is done as it should be. It remains to show that the sets of reachable elements decrease with R. 
Fact 7.3.5 

In above construction, if mRm', then R(m') C R{m). 
Proof 

By induction, considering R. □ (Fact 17.3.51 (page 1151)) and Proposition 17.3.41 (page 1151)) ) 

We consider an example for illustration. 
Example 7.3.1 

Let a\Ra2RcRc\, biRb 2 Rb3RcRdiRd,2. 

Let C = (Ax > B\, . . . , A n > B n ) with the Cj consistency with fj,(Ai). 

Let /J,(A2) fl B2 = 0, Li(Aa) C i? 3 , and for the other i hold neither of these two. 

Let 01, 02, 62, ci, c?2 |= C, the others ^= C. 

Let u(j4i) = {ai 1,01,2}, with a u e Si, ai 2 4. B\, 
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and the others like n{A\). Let [iA := {J{n(Ai) : i < n}. 

We have to start at a\ and b\, and make R(x) progressively smaller. 

Let R(ai) := fiA — {ai t2 }, so a\ \= C. Let R(a 2 ) = R(a\), so again a 2 \= C. 

Let R(bi) := [iA — {a^i}, so b\ Y= C. We now have to take cti.2 away, but 02,1 too to be able to change. So let 
R(b 2 ) := R(bi) — {oi,2)ffl2,i}) so we begin at ^(^3), which is a (positive) singleton. Then let R(b 3 ) := R(b 2 ) — {03,1}- 

We can choose R(c) :— R(bs), as i?(&3) C R(a 2 ). 

Let R(ci) := R(c) — {04,2} to make C hold again. Let R(d\) :— R(c), and R(d 2 ) := R{c\). 
□ 



Chapter 8 

Theory update and theory revision 



8.1 Update 
8.1.1 Introduction 



We will treat here problems due to lack of information, i.e. we can "see" some dimensions, but not all. 
8.1.2 Hidden dimensions 

8.1.2.1 Introduction 

We look here at situations where only one dimension is visible in the results, and the other ones stay hidden. This is e.g. 
the case when we can observe only the outcome of developments, but not the developments themselves. 

It was the authors' intention to treat here the general infinite case, and then show that the problems treated in [BLS99j and 
|LMS01| (the not necessarily symmetric case there) are special cases thereof. Unfortunately, we failed in the attempt to 
solve the general infinite case, it seems that one needs new and quite different methods to solve it, so we will just describe 
modestly what we see that ca n be done, what the problems seem to be, and conclude with a very short remark on the 
situation described in [BLS99] . 

8.1.2.2 Situation, definitions, and basic results 

In several situations, we can observe directly only one dimension of a problem. In a classical ranked structure, we "see" 
everything about an optimal model. It is there, and fully described. But look at a ranking of developments, where we can 
observe only the outcome. The earlier dimensions remain hidden, and when we see the outcome of the "best" developments, 
we do not directly see the threads which led there. A similar case is theory revision based on not necessaril y symme tric 
distances, where we cannot "look back" from the result to see the closest elements of the former theory (see I..MSO 1 1 for 
details). 

The non-definable case and the case of hidden dimensions are different aspects of a common problem: In the case of 
non-definability, any not too small subset might generate what we see, in the case of hidden dimensions, any thread ending 
in an optimal outcome might be optimal. 



8.1.2.2.1 The situation, more formally 

The universe is a finite or even infinite product HUi, i £ I. We will see that the finite case is already sufficiently nasty, so 
we will consider only the finite situation. If X C HUi, then possible results will be projections on some fixed coordinate, 
say j, of the best a £ X, where best is determined by some ranking -< on HUi, nj(fj,(X)). 

As input, we will usually not have arbitrary X C HUi, but again some product X :— HUi, with U- C [/j. Here is the main 
problem: we cannot use as input arbitrary sets, but only products. We will see that this complication will hide almost all 
information in sufficiently nasty situations. 

We will make now some reasonable assumptions: 

First, without loss of generality, we will always take the last dimension as outcome. Obviously, this does not change the 
general picture. 

Second, the difficult situations are those where (some of) the Ui are infinite. We will take the infinite propositional case 
with theories as input as motivation, and assume that for each Ui, we can choose any finite U[ as input, and the possible 
U[ are closed under intersections and finite unions. 

Of course, HU\ U HU" need not be a product HVi - here lies the main problem, the domain is not closed under finite unions. 

We will see that the case presents serious difficulties, and which the authors do not know how to solve. The basic problem 
is that we do not have enough information to construct an order. 

Notation 8.1.1 

(.)! will be the projection on the fixed, (last) coordinate, so for X C HUi, i < n, X\ := {cr(n) : a £ X}, and analogously, 
a\ := a(n) for a £ HUi. 
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For any set of sequences A, [X] will be the smallest product which contains A. 

Likewise, for cr, a' , [a, cr'] will be the product (which will always be defined as we have singletons and finite unions in the 
components), H{a(i),a'(i)} (i £ I). Analogously, {a, a') := [cr, a 1 ] — {cr, a'}. (The interval notation is intended.) 

If a e X, a cr-covcr of X will be a set {X k : k £ K} s.t. a ^ X k for all k, and \J{X k : k £ K}l) {a} = X. 

A (finite) sequence a — 01, . . . , a n will also be written (<ti, . . . , a n ). 

The Hamming distance is very important in our situation. We can define the Hamming distance between two (finite) 
sequences as the number of arguments where they disagree, or as the set of those arguments. In our context, it does not 
matter which definition we choose. " H— closest to cr " means thus "closest to a, measured by (one of) the Hamming 
distance(s) ". 

8.1.2.2.2 Direct and indirect observation 

If cr! = cr'!, wc cannot compare them directly, as we always see the same projection. If cr! ^ a'\, we can compare them, if 
e.g. X := {c, cr'} is in the domain of fi (which usually is not the case, but only if they differ only in the last coordinate), 
n(X)\ can be {cr!}, {cr'!}, {a\,a'\}, so cr is better, cr' is better, or they are equal. 

Thus, to compare cr and cr' if cr! = cr'!, we have to take a detour, via some r with r! ^ a\. This is illustrated by the following 
example: 

Example 8.1.1 

Let a set A be given, B := {b, b'}, and p{A x B)\ = {b}. Of course, without any further information, we have no idea if for 
all a £ A (a, b) is optimal, if only for some, etc. 

Consider now the following situation: Let a, a' £ A be given, A' := {a, a'}, and C := {b, c, c'} - where c, c' need not be in 
B. Let A' x C, {a} x C, {a'} x C be defined and observable, and 

{l)n(A'xC)\ = {b,c}, 

(2) p({a}xC)\ = {b,c}, 

(3) M{a'}xC)! = {6,c'}. 

Then by (3) (a', b) w (a',c') -< (a',c), by (2) (a, b) « (a, c) -< (a, c'), by (1) (a',c') cannot be optimal, so neither is (a', b), 
thus (a, b) must be better than (a', 6), as one of (a, 6) and (a', 6) must be optimal. 

Thus, in an indirect way, using elements c and c' not in B, we may be able to find out which pairs in A x B are really 
optimal. Obviously, arbitrarily long chains might be involved, and such chains may also lead to contradictions. 

□ 



So it seems difficult to find a general way to find the best pairs - and much will depend on the domain, how much we can 
compare, etc. It might also well be that we cannot find out - so we will branch into all possibilities, or choose arbitrarily 
one - loosing ignorance, of course. 

Definition 8.1.1 

Let {X k : k £ K} be a a— cover of X. 

Then we can define -< and < in the following cases: 

(1) Vfc(^A fe ! % fj,X\) -> a <a' for all a' £ X, a' =/= cr, 

(2) fj,X\ % U ^A fe ! a < cr' for all a' £ X s.t. cr'! g fJ,{X)\, and a <a' for all a 1 £ X s.t. cr'! £ /j,X\. 

Explanation of the latter: If cr'! £ XI, there is some cr" s.t. a" is one of the best, and cr"! = a' - but we are not sure which 
one. So we really need ^ here. 

Of course, we also know that for each x £ /j,X\ there is some a s.t. cr! = x and cr < <r' for all cr' £ X, etc., but we do not 
know which cr. 

We describe now the two main problems. 

We want to construct a representing relation. In particular, 

(a) if t £ X, t! ^ [J.X\, we will need some cr £ A, cr -< r, 
and 

(b) if t! e a^'j an d x' ^ t!, x' G /xA!, we will need some cr G A, cr ^ r. 

Consider now some such candidate, and the smallest set containing them, A := [t, cr]. 

Problem (1): cr -< r might well be the case, but there is t' G (t,<t), t'! = r!, and r' -< cr - so we will not see this, as 
^A! = {t'\}. 

Problem (2): o <t might well be the case, but there is cr' G (t, cr), cr'! = cr!, cr' -< r - so we will not see this, as already 
/i[cr', t]! = cr!. 

Problem (2) can be overcome for our purposes, by choosing a H— closest a s.t. a -< r, and if (1) is not a problem, we can 
see now that a -< r : for all suitable p we have /i[p, r]! = r!, and only /x[r, cr]! = ct!. 
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elements of [cr', t], so we have to take the H— closest one to avoid Problem (2), work with this one, and we will see that 
a' -< t. 

Both cases illustrate the importance of choosing H— closest elements, which is, of course, possible by finiteness of the index 
set. 

Problem (1) can, however, be unsolvable if we take the limit approach, as we may be forced to do (see discussion in Section 
18.1.21 (page 1 1 53[) ) , as we will not necessarily have any more such cr which minimize all r - see Example 18.1.41 (page I156[) , 
Case (4). 

But in the other, i.e. non-problematic, cases, this is the approach to take: 

Choose for r some a s.t. a < p for all p - if this exists - then consider [r, cr], choose in [r, cr] the H— closest (measured from 
r) p s.t. p is smaller or equal to all p' G [p, r]. This has to exist, as a is a candidate, and we work with a finite product, so 
the H— closest such has to exist. 

We will then use a Cover Property: 

8.1.2.2.3 Property 6.3.1 

Let t G A, t! ^ pA!, Yj x := {cr G X : cr! = x} for some fixed x G pA!. Let H x C (J for i G /, and X, D E$ U {cr}, then: 
There is z G / s.t. r! ^ pJQ!. 

(This will also hold in the limit case.) 

This allows to find witnesses, if possible: 

Let t £ X, t! pA!, then r is minimized by one cr with cr! G pA!. Take such cr with H— closest to r, this exists by Cover 
Property. Then we see that t! ^ p,[<7, r]!, and that cr does it, so we define a -< r. 

We will also have something similar for 2 elements cr, cr' e I, a! / cr'!, cr!, cr! e //X! : We will seek cr, cr' which have minimal 
.ff— distance among all r, r with the same endpoints s.t. a ~< p, a' < p for all p G [cr, cr']. This must exist in the minimal 
variant, but, warning, we are not sure that they are really the minimal ones (by the order) - see Example l8.1.4l (pagc [T5Gl) . 
(1). 

8.1.2.2.4 Small sets and easy properties 

As a first guess, one might think that small sets always suffice to define the relation. This is not true. First, if cr! = cr'!, 
then [cr, cr'] will give us no information. 

The following (also negative) example is more instructive: 
Example 8.1.2 

Consider X := {0,1,2} x {0,1}, with the ordering: (0,0) < (2.1) < (1,0) ~< (0,1) - (1,1) - (2,0). We want to find out 
that (0, 0) -< (1, 1) using the results pYl 

Consider X' := {0,1} x {0,1}, the smallest set containing both. Any (0,0)— cover of X' has to contain some X" with 
(1,0) G X", but then pA"! = {0}, so we cannot see that (0,0) -< (1, 1). 

But consider the (0,0)-cover {X',X"} of X with X' := {1,2} x {0,1}, X" := {(0,1)}, then pX'l = pX"\ = {1}, but 

pX\ = {0}, so we see that (0, 0) -< (1, 1). 

□ 



(But we can obtain the same result through a chain of small sets: (0,0) < (2,1) : look at the (0,0)— cover {{2} x 
{0, 1}, {(0, 1)}} of {0, 2} x {0, 1}, (2, 1) < (2, 0) is obvious, (2, 0) < (1, 1) : look at the (2, 0)-cover {{1} x {0, 1}, {(2, 1)}} 
of {1, 2} x {0, 1}, we see that p({l} x {0, 1})! = p{(2, 1)}! = {1}, and p({l, 2} x {0, 1})! = {0, 1}.) 

Remark 8.1.1 

The following are immediate: 

(1) nini-v; :iei}:jeJ} = Uimxf :jeJ}-.ieI} 

(2) pX\ C X\ 

(3) X = {jX k ^ pXlC{J(pX k l) 

(4) X C X' -> A! - fiXl C A'! - fjtX'l is in general wrong, though A C A' -> A - pX C A' - fiX' is correct: There 
might be a new minimal thread in A', which happens to have the same endpoint as discarded threads in A. 

8.1.2.3 A first serious problem 



We describe now the first serious problem, and indicate how to work around it. So, we think this can still be avoided in a 
reasonable way, it is the next one, where we meet essentially the same difficulty, and where we see no solution or detour. 



We will complete the ranked relation (or, better, what we see of it) as usual, i.e. close under transitivity, it has to be free 

-r tt n__ - l-j-* j_ i_ - r__n__ j _j i i _ - r -l j_ • *j_ • 
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have to take care that we do not have infinite descending chains of threads a with the same endpoint, as the endpoint will 
then not appear anymore, by definition of minimality. More precisely: 

Example 8.1.3 

Let u! = r! and a'\ = t'\, and a' ~< a, r -< t' , then choosing a <t will result in a' ~< r'. Likewise, a" < r" with a"\ = t"\ 
may lead to p' -< a' (with a'l = p'!), etc, and to an infinite descending chain. This problem seems difficult to solve, and 
the authors do not know if there is always a solution, probably not, as we need just uj many pairs to create an infinite 
descending chain, but the universe can be arbitrarily big. 

8.1.2.3.1 Two ways to work around this problem: 

(1) We may neglect such infinite descending chains, and do as if they were not there. Thus, we will take a limit approach 
(see Section 1531 (page HOlj) ) locally, within the concerned a\. This might be justified intuitively by the fact that we are 
really interested only in one dimension, and do not really care about how we got there, so we neglect the other dimensions 
somewhat. 

(2) We may consider immediately the global limit approach. In this case, it will probably be the best strategy to consider 
formula defined model sets, in order to be able to go easily to the logical version - see Section I5~5l fpage |101[) 

For (1): 

We continue to write /xA! for the observed result, though this will not be a minimum any more. 
We then have: 

Vcr G XVx G pXBt(t\ =iAr^ff), 

but we will not have any more: 

V.t G fiXNa G X3t G X{t\ = x A Vcr G X.t ^ a) 

Note that in the limit case: 

(1) X + -» f iX\ + 

(2) If x,x' G /iA!, and x is a limit (i.e. there is an infitite descending chain of a's with a — x), then so is x' . 

(3) By finiteness of the Hamming distance, e.g. there will be for all a G X and all x G /xA! cofinally often some H— closest 
(to a) t <E X s.t. r! = x and r ■< a. (Still, this will not help us much, unfortunately.) 

8.1.2.4 A second serious problem 

This problem can be seen in a (class of) unpleasant example(s) , it has to do with the fact that interesting elements may 
be hidden by other elements. 

Example 8.1.4 

We discuss various variants of this example, and first present the common parts. 

The domain U will be 2 x w x 2. X C U etc. will be legal products. We define suitable orders, and will see that we can 
hardly get any information about the order. 

o-j := (0,i,0), Ti := 1). 

The key property is: 

If <7i,Tj G X C U for some i,j G to, then 07. G X «-> G X. (Proof: Let e.g. <7k = (0, k, 0) G X, then, as 1) G X, 
(l,fc,l)GX) 

In all variants, we make a top layer consisting of all (l,i, 0) and (0, i, 1), i G u). They will play a minor role. All other 
elements will be strictly below this top layer. 

We turn to the variants. 

(1) Let Gi ~ Ti for all i, and Oi ~ -< (Ji+i ~ Tj+i, close under transitivity. Thus, there is a minimal pair, Co, To, and we 
have an "ascending ladder" of the other c^, r^. But we cannot see which pair is at the bottom. 

Proof: Let X C U. If X contains only elements of the type (a, 6,0) or (a, 6,1), the result is trivial: fiXl — {0}, etc. So 
suppose not. If X contains no o\ and no Ti, the result is trivial again, and gives no information about our question. The 
same is true, if X contains only c-s or only r/s, but not both. If X contains some Ci, and some tj, let k be the smallest 
such index i or j, then, by above remark, and will be in X, so fiXl = {0, 1} in any case, and this will not tell us 
anything. (More formally, consider in parallel the modified order, where the pair <j\, t\ is the smallest one, instead of cro; 
to, and order the others as before, then this different order will give the same result in all cases.) 

(2) Order the elements as in (1), only make a descending ladder, instead of an ascending one. Thus, pTJ\ = 0, but [iX\ in 
variant (1) and in variant (2) will not be different for any finite X. 

Proof: As above, the interesting case in when there is some Oi and some Tj in X, we now take the biggest such index k, 
then tTfc and Tk will both be in X, and thus /iA! = {0, 1}. Consequently, only infinite X allow us to distinguish variant (1) 
from variant (2). But they give no information how the relation is precisely defined, only that there are infinite descending 
chains. 

Thus, finite X do not allow us to distinguish between absence and presence of infinite descending chains. 

(3) Order the Oi in an ascending chain, let Oi -< r^, and no other comparisons between the Oi and the Tj. Thus, this is not 
a ranked relation. Again, the only interesting case is where some Oi G A, and some Tj G A. But by above reasoning, we 
have always uA! = |0|. 
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Thus, we cannot distinguish ranked from non-ranked relations. 

(4) Let now CTj -< Tj, and Tj + i -< <jj, close under transitivity. As we have an infinite descending chain, we take the limit 
approach. 

Let X be finite, and some Ui = (0, i, 0) G A, then ^Y! = {0}. Thus, we can never see that some specific Tj minimizes some 

Ok- 

Proof: All elements of the top layer are minimized by erj. If there is no Tj G Y, we are done. Let T& = (l,k, 1) be the 
smallest Tj G X, then by the above — (0, fc, 0) G X, too, so 1 ^ pXl. 

But we can see the converse: Consider X := [<Tj, r,], then /xY! = {0}, as we just saw. The only other a G X with cr! = is 
(1, i, 0), but we see that Tj -< cr, as /i[Tj, cr] = {1} and [Tj, cr] contains only 2 elements, so <7j must minimize Tj. 
Consequently, we have the information <7j -< Ti, but not any information about any Tj -< (Tj. Yet, taking the limit approach 
applied to U, we see that there must be below each Oj some Tj, otherwise we would see only {0} as result. But we do not 
know how to choose the Tj below the Uj. 

□ 



8.1.2.5 Resume 



The problem is that we might not have enough information to construct the order. Taking any completion is not sure to 
work, as it might indirectly contradict existing limits or non-limits, which give only very scant information (there is or is 
not an infinite descending chain), but do not tell us anything about the order. 

For such situations, cardinalities of the sets involved might play a role. 

Unfortunately, the authors have no idea how to attack these problems. It seems quite certain that the existing techniques 
in the domain will not help us. 

So this might be a reason (or pretext?) to look at simplifications. 
It might be intuitively justifiable to impose a continuity condition: 

(Cont) If t is between a and a' in the Hamming distance, then so is its ranking, i.e. p(a) < p{o') — > p(a) < p{r) < p{o'), 
if p is the ranking function. 

This new condition seems to solve above problems, and can perhaps be justified intuitively. But we have to be aware of 
the temptation to hide our ignorance and inability to solve problems behind flimsy intuitions. Yet, has someone ever come 
up with an intuitive justification of definability preservation? (The desastrous consequences of its absence were discussed 
in |Sch04j .) 

On the other hand, this continuity or interpolation property might be useful in other circumstan ces, too, where we can 
suppose that small changes have small effects, see the second author's Sch95-2J or Chapter 4 in |Sch97-2] for a broader 
and deeper discussion. 

This additional property should probably be further investigated. 

A word of warning: this condition is NOT compatibel with distance based reasoning: The distance from to 2 is the 
same as the distance from 1 to 3, but the set [(0,2), (1,3)] contains (1,2), with a strictly smaller distance. A first idea is 
then not to work on states, but on differences between states, considering the sum. But this does not work either. The 
sequences (0, 1), (1, 0) contain between them the smaller one (0, 0), and the bigger one (1, 1). Thus, the condition can be 
countertintuitive, and the authors do not know if there are sufficiently many natural scenarios where it holds. 

It seems, however, that we do not need the full condition, but the following would suffice: If a is such that there is t 
with cr! ^ t!, and t -< cr, then we find r' s.t. t! = r'\ and t' is smaller than all sequences in [cr, t] (and perhaps a similar 
condition for -£). Then, we would see smaller elements in Example 18 .1.41 fpagc !156p . This is a limit condition, and is similar 
to th e condi tion that there are cofinally many definable initial segments in the limit approach, see the discussion there - 
or in [Sch04] , 

8.1.2.6 A comment on former work 

We make here some very short remarks on our joint article with S.Berger and D.Lehmann, BLS99 . 
The perhaps central definition of the article is Definition 3.10 in [BLS99J. 

First, we see that the relation R is generated only by sets where one includes the other. Thus, if the domain does not 
contain such sets, the relation will be void, and trivial examples show that completeness may collaps in such cases. 

More subtly, case 3. (a) is surprising, as Vz([-BJ . . . B^ l _ 1 .C] % [Ai . . . .A n ^i.C\) is expected, and not the (in result) much 
weaker condition given. 

The proof shows that the very definition of a patch allows to put the interesting sequence in all elements of the patch 
(attention: elements of the patch are not mutually disjoint, they are only disjoint from the starting set), so we can work 
with the intersection. The proof uses, however, that singletons are in the domain, and that the elements of arbitrary 
patches are in the domain. In particular, this will generally not be true in the infinite case, as patches work with set 
differences. 
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8.2 Theory revision 

We begin wit h a ver y succinct introduction into AGM theory revision, and the subsequent results by the second author as 
published in [Sch04]. It is not supposed to be a self-contained introduction, but to help the reader recollect the situation. 

Section IS. 2. 21 (page [TB5|) describes very briefly parts of the work by Booth and co-authors, and then solves a representation 
problem in the infinite case left open by Booth et al. 

8.2.1 Introduction to theory revision 

Recall from the introduction that theory revision was invented in order to "fuse" together two separately consistent, but 
together inconsistent theories or formulas to a consistent result. The by fa r best known approach is that by Alchourron, 
Gardenfors, and Makinson, and know as the AGM approach, see [AGM85] . They formulated "rationality postulates" for 
various variants of theory revision, which we give now in a very succinct form. Lehmann, Magidor, Schlechta, see [LMS01 , 
gave a distance semantics for theory revision, this is further elaborated in |Sch04| . and presented here in very brief outline, 
too. 

Definition 8.2.1 

We present in parallel the logical and the semantic (or purely algebraic) side. For the latter, we work in some fixed universe 
U, and the intuition is U = Mc, X = M(K), etc., so, e.g. A <E K becomes X C B, etc. 

(For reasons of readability, we omit most caveats about definability.) 

K± will denote the inconsistent theory. 

We consider two functions, - and *, taking a deductively closed theory and a formula as arguments, and returning a 
(deductively closed) theory on the logics side. The algebraic counterparts work on definable model sets. It is obvious that 
(K — 1), (K * 1), (K — 6), (K * 6) have vacuously true counterparts on the semantical side. Note that K (X) will never 
change, everything is relative to fixed K (X). K * cp is the result of revising K with <p. K — (p is the result of subtracting 
enough from K to be able to add -xp in a reasonable way, called contraction. 

Moreover, let <k be a relation on the formulas relative to a deductively closed theory K on the formulas of £, and <x 
a relation on V(U) or a suitable subset of V(U) relative to fixed X. When the context is clear, we simply write < . <k 
(<x) is called a relation of epistemic entrenchment for K (X). 

The following table presents the "rationality postulates" for contraction (-), revision (*) and epistemic entrenchment. In 
AGM tradition, K will be a deductively closed theory, cp, ip formulas. Accordingly, X will be the set of models of a theory, 
A, B the model sets of formulas. 

In the further development, formulas <p etc. may sometimes also be full theories. As the transcription to this case is 
evident, we will not go into details. 



Contraction, if — cp 


(K-l) 


K — cp is deductively closed 






(if -2) 


K-cpCK 


(X 9 2) 


X CXOA 


(if -3) 




(16 3) 


X %A^ X GA = X 


(if -4) 




(194) 


A^U ^ X QA<Z A 


(if -5) 


if c (if - <p) u {((>} 


(16 5) 


(xeA)nAcx 


(if -6) 


hcp^tp^K~cj> = K — ip 






(if -7) 


(if -(f)) n (if - V) c 


(16 7) 


xe(AnB)c 




if - {0 A ip) 




(xeA)u(xe B) 


(if -8) 


<P^K-(4>Aip) => 


(16 8) 


xe(AnB)<ZA^ 




K-{tj>Ail>)CK-<j) 




x qacx e{AnB) 


Revision, if * <j> 


(if * 1) 


if * cp is deductively closed 






(if *2) 


cp £ if * <f> 


(X\2) 


X \ AC A 


(if *3) 


if * <p c if u {</)} 


(X]3) 


XnACX | A 


(if *4) 




(X|4) 


xr\A^$^ 




K U {cp} C if * <t> 




X\ACXr\A 


(if *5) 


K*<f) = if ± h -.0 


(X|5) 


X\A = %^ A = % 


(if *6) 


\-cp*->ip^-K*cp = K*ip 






(if* 7) 


if * (cp A ip) C 


(X|7) 


(X | A) n B c 




(if * cj>) U {ip} 




x | (infl) 


(if *8) 


-t0 £■ if * cp =>■ 


(X|8) 


(x | A) n s / => 




(if *<f>)U{ip} CK*(<pAip) 




^ 1 U n B) c (x | 4) n s 


Epistemic entrenchment 


(EE1) 


<k is transitive 




<x is transitive 


(EE2) 


4> h ip =^ 4> <k ip 


(EE2) 


ACB^A< X B 


(EES) 




(EES) 






(<P <Jf <P A ip or ip <k (p A ip) 




(A <x 4 n B or B <x An B) 


(EE 4) 




(EE4) 






{<P if iff V0.0 <K Ip) 




(Igi iff VB.A < x B) 


(EE5) 


Vlp.lp <K <p =>h <p 


(EE5) 


VB.B < x A^ A = U 
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(1) Note that (X | 7) and (X | 8) express a central condition for ranked structures, see Section 3.10: If we note X \ . by 
/*(.), we then have: f x (A) n B ^ => nB) = /jr(A) n B. 

(2) It is trivial to see that AGM revision cannot be defined by an individual distance (see Definition 2.3.5 below): Suppose 
X | Y := {y G Y : 3x y G A(V ? / G F.d(a; y ,y) < d^, y'))}- Consider a, 6, c. {a, b} | {6, c} = {6} by (X | 3) and (X | 4), so 
d(a, b) < d(a, c). But on the other hand {a, c} \ {b, c} = {c}, so d(a, b) > d(a, c), contradiction. 

Proposition 8.2.2 

Contraction, revision, and epistcmic entrenchment are interdcfinablc by the following equations, i.e., if the defining side 
has the respective properties, so will the defined side. 



K * cf> := (K - ^<p) U <p 


X | A : = 


= (x e ca) n a 


K - := K n {K * -<4>) 


XQA: 


= X U (X | CM) 


K - 4> := {i/> e K : (<f> < K 4> V V or h <f>)} 


[ X if f A = U, 
6 ~ \ C\{B : X C B C U, A < x AUB} otherwise 


1 h 4> A l/> 
<K i/ 1 :<-» I or 

[ 4><ZK-(4,Ai>) 


A <x B :<-» < 


= C/ 

or 

xe(AnB) 2 -4 



The idea of epistemic entrenchment is that (j> is more entrenched than %p (relative to K) iff M(^ip) is closer to M(K) than 
M(-«f)) is to M(K). In shorthand, the more we can twiggle K without reaching -i</>, the more <j) is entrenched. Truth is 
maximally entrenched - no twiggling whatever will reach falsity. The more </> is entrenched, the more we are certain about 
it. Seen this way, the properties of epistemic entrenchment relations are very natural (and trivial): As only the closest 
points of M(-i</>) count (seen from M(K)), <f) or ip will be as entrenched as (pAip, and there is a logically strongest cf>' which 
is as entrenched as - this is just the sphere around M(K) with radius d(M(K), M(^(f>j). 

Definition 8.2.2 

d : U x U — > Z is called a pseudo-distance on U iff (dl) holds: 
(dl) Z is totally ordered by a relation < . 

If, in addition, Z has a < —smallest element 0, and (d2) holds, we say that d respects identity: 
(d2) d(a, b) = iff a = b. 

If, in addition, (d3) holds, then d is called symmetric: 
(d3) d(a,b) = d(b,a). 
(For any a, b G U.) 

Note that we can force the triangle inequality to hold trivially (if we can choose the values in the real numbers): It suffices 
to choose the values in the set {0} U [0.5, 1], i.e. in the interval from 0.5 to 1, or as 0. 



Definition 8.2.3 

We define the collective and the individual variant of choosing the closest elements in the second operand by two operators, 
\,]:V(U) xV(U) -+V(U) : 

Let d be a distance or pseudo-distance. 

X | Y := {y e Y : 3x y e X.W G X,Vy' G Y(d(x y ,y) < d(x',y')} 

(the collective variant, used in theory revision) 

and 

X T Y := {y E Y : 3x y G X.Vy' G Y(d(x y ,y) < d(x y ,y')} 

(the individual variant, used for counterfactual conditionals and theory update). 

Thus, A \d B is the subset of B consisting of all b G B that are closest to A. Note that, if A or B is infinite, A \d B may 
be empty, even if A and B are not empty. A condition assuring nonemptiness will be imposed when necessary. 



Definition 8.2.4 

An operation |: V(U) x V(U) — > V(U) is representable iff there is a pseudo-distance d : U x U — > Z such that 
A | B = A \ d B := {b e B : 3a b G AW G AW G B(d(a b , b) < d(a', b'))}. 

The following is the central definition, it describes the way a revision *<j is attached to a pseudo-distance d on the set of 
models. 



Definition 8.2.5 

T* d T> := Th(M(T) \ d M(T')). 
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Fact 8.2.3 

A distance based revision satisfies the AGM postulates provided: 

(1) it respects identity, i.e. d(a, a) < d(a, b) for all a =/= b, 

(2) it satisfies a limit condition: minima exist, 

(3) it is definability preserving. 

(It is trivial to see that the first two are necessary, and Example l8.2.1l (page [TFT]) (2) below shows the necessity of (3). In 
particular, (2) and (3) will hold for finite languages.) 



Proof 

We use | to abbreviate \d ■ As a matter of fact, we show slightly more, as we admit also full theories on the right of *. 
(K * 1), (K * 2), (K * 6) hold by definition, (K * 3) and (K * 4) as d respects identity, (K * 5) by existence of minima. 
It remains to show (K * 7) and (K * 8), we do them together, and show: If T * T' is consistent with T", then T * (T 1 U T") 
= (T * T') U T". 

Note that M(S U S") = M(S) n M(S'), and that M(S * S') = M(S) \ M(S'). (The latter is only true if | is definability 
preserving.) By prerequisite, M(T * T ) n M(T") ^ 0, so (M(T) | M(T')) n M(T") f 0. Let A := M(T), B := M(T'), 
C := M(T"). " C ": Let b e A \ (B n C). By prerequisite, there is b' £ {A | B) D C. Thus b') > d(A, B n C) = d(A, 6). 
As 6 G B, 6 G A | S, but 6 £ C, too. " D " : Let b' £ (A \ B) n C. Thus d(A 6') = d(A S) < d(A, B n C), so by 6' G B n C 
6' G A | (SnC). We conclude M(T) \ (M(T')nM(T' )) = (M (T) \ M(T'))r\M(T"), thus that T*(T'UT") = (T * T') U T". 
□ 



Definition 8.2.6 

For A, Y ^ 0, set £/y(X) := {z : d(X, z) < d(X, Y)}. 

Fact 8.2.4 

Let X,Y,Zjt 0. Then 

(1) &V(A) n Z ^ iff (A | (Y u Z)) n z ^ 0, 

(2) LV(A) n Z ^ iff CZ <x CY - where <x is epistemic entrenchement relative to X. 

Proof 

(1) Trivial. 

(2) cz < x cy iff x e (cz n cr) ^cz.ie (cz n CY) = iu(i| c(cz n CY)) = x u (x \ (z u r)). So 
A e (CZ n CY) % CZ & (A u (a | (z u Y))) n Z ^ & X n Z ^ or (A | (z u Y)) n Z ^ <s> d(A, Z) < d(X, Y). 

□ 



Definition 8.2.7 

Let U ^ 0, y C P(f7) satisfy (n), (U), ^ J 7 . 
Let |:y x ^ -> P(C/). 

Let * be a revision function defined for arbitrary consistent theories on both sides. (This is thus a slight extension of the 
AGM framework, as AGM work with formulas only on the right of *.) 







|= T <-> S, |= T' <-» S". =^ T * T' = S * S", 






(*CCL) 

T * T' is a consistent, deductively closed theory, 




(| Succ) 
A | B C B 


(*5'ucc) 
T' C T * T', 




(| Con) 

AnB^$=>A\B = AnB 


(*Con) 

Con(T u T') => T * T' = T U T' , 


Intuitively, 
Using symmetry 
d(X ,Xi) < d(X 1 ,X 2 ), 
d(Xi,X a ) < d(X 2 ,X 3 ), 
d(X 2 ,X 3 ) < d(X 3 ,Xi) 


( Loop) 

(Xi | (X u X 2 )) n X # 0, 
(X 2 j (Xi UXj nx^«, 
(X 3 | (X 2 u X 4 )) n X 2 0, 


(*Loop) 

Con(T ,T! * (To V T 2 )), 
Con(Ti,T 2 * (Ti VT 3 )), 
Con(T 2 ,T 3 * (T 2 V T 4 )) 


d(X fc _i,X fc ) < d{X ,X k ) 


(X* | (X h _i uio))nln / 


Gon(T k - X ,T k * (T fe _i VT»)) 


^Xo.Xi) < d(X ,X k ), 


(X | (X k uX 1 ))nX 1 #0 


c O n(r 11 r *(r fc vri)) 
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Proposition 8.2.5 

The following connections between the logical and the algebraic side might be the most interesting ones. We will consider 
in all cases also the variant with full theories. 

Given * which respects logical equivalence, let M(T) | M(T') := M(T * T'), conversely, given |, let T * V := Th(M(T) 
M(T')). We then have: 



(1.1) 






(X|7) 


(1.2) 


<= (M rf P) 


(1.3) 


<= B is the model set lor some <f> 


(1.4) 


in general 


(2.1) 


(*Loop) 




(1 Loop) 


(2.2) 


<= (M rf P) 


(2.3) 


<= all Xi are the model sets lor some <pi 


(2.4) 


jt= in general 



Proof 

(1) 

We consider the equivalence of T * (T' U T") C (T * T') U T" and (M (T) \ M(T')) n M(T") C M(T) | (M(T') n M(T")). 
(1.1) 

(M (T) | M(T')) n M(T") = M(T * T') n M{T") = M((T * T') U T") C (x , 7) M(T * (T' U T")) = M(T) \ M(T' U T") = 
M(T) | (M(T ) nM(T")). 

(1.2) 

T * (T' U T") = Th(M(T) \ M{T' U T")) = Th(M(T) | (M(T') n M(T"))) Q {x \7) Th((M(T) \ M(T')) n M(T")) = (Mp) 

Th(M(T) | Af(T')) UT" = Th(M(T * T') U T" = (T*T') UT". 

(1.3) 

Let T" be equivalent to </>". We can then replace the use of (/i<ip) in the proof of (1.2) by Fact 12.2 ."31 (pagel29|) (3). 
(1.4) 

By Example HXTJ (page [HT|) (2), (K * 7) may fail, though (X | 7) holds. 
(2.1) and (2.2): 

Con{T 0l T x * (T V T 2 )) & M(T ) n M(T X * (T V T 2 )) ^ 0. 

M(Ti * (T V T 2 )) = MiThiMiTr) \ M(T V T 2 ))) = M(Th(M(T 1 ) | (M(T ) U A/(T 2 )))) = (/tdp) M(Ti) | (M (T ) U (T 2 )), 
so ConiTo,^ * (To VT 2 )) M(T ) n (M(Ti) | (M(T ) U (T 2 ))) ^ 0. 

Thus, all conditions translate one-to-one, and we use (| Loop) and (*Loop) to go back and forth. 

(2.3) : 

Let A := M{Th(M(T x ) \ (M(T ) U M(T 2 )))), A' := M(T X ) | (M(T ) U (T 2 )), then we do not need A = A', it suffices to 

have M(T ) ni^0^M (T ) n A' ^ 0. A = A' , so we can use Fact l2~231 (page |29| (4), if T is equivalent to some (f> . 
This has to hold for all T i; so all T have to be equivalent to some (pi. 

(2.4) : 

By Proposition l8.2.6l (page fT62|) . all distance defined | satisfy (| Loop). By Example 18.2. II (page I16ip (1), (*Loop) may fail. 
□ 



The following table summarizes representation of theory revision functions by structures with a distance. 
By "pseudo-distance" we mean here a pseudo-distance which respects identity, and is symmetrical. 
(| 0) means that if X, Y ^ 0, then X \ d Y ^ 0. 



— function 




Distance Structure 




* — function 


(| Suae) + (| Con) + 
(1 Loop) 


«Mu).+ fn) 

ProDosition 18.2.61 
page [162] 


pseudo-distance 


{i*d P ) + (\ b) 

ProDositionl8.2.7l 
page 1(>2 


(*Bgui)j) + (*CCL) + (*Sjicc) + 
(*Con) + (*Loop) 


any finite 
characterization 


& 

ProDositionl8.2.8l 

page 


7^ without [ udv) 
ExamDlol8.2.1l 
pagc[161J 



The following Example 1 8 . 2 . 1 1 f page 1 1 6 lj) shows that, in general, a revision operation defined on models via a pseudo-distance 
by T * T' := Th(M(T) |<j M(T')) might not satisfy (*Loop) or (K * 7), unless we require |<j to preserve definability. 

Example 8.2.1 

Consider an infinite propositional language C. 

Let X be an infinite set of models, to, mi, m 2 be models for £. Arrange the models of £ in the real plane s.t. all x G X 
have the same distance < 2 (in the real plane) from to, to 2 has distance 2 from to, and mi has distance 3 from to. 
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(1) M(T') = X U {mi}. T,T',T 2 will be pairwise inconsistent. 

(2) M(T')=XU{m 1 ,m 2 }, M(T") = {m u m 2 }. 

Assume in both cases Th(X) ~ T', so X will not be definable by a theory. 
Now for the results: 

Then M{T) | M(T') = X, but T * T = Th(X) = V . 

(1) We easily verify Con{T,T 2 *(T\/T)), Con{T 2 ,T*{T 2 VT x )), Con(T, Tx*(T\/T)), Con(Ti,T*(TiVT')), Con{T, T'*(TVT)), 
and conclude by Loop (i.e. (*Loop)) Con(T 2 ,T * (T" V T 2 )), which is wrong. 

(2) So T * T is consistent with T", and (T * T') U T" = T". But T U T" = T", and T * (T' U T") = T 2 ^ T", contradicting 
(if* 7). 

□ 



Proposition 8.2.6 

Let [7 ^ 0, y C P(f7) be closed under finite n and finite U, £ y. 

(a) | is representable by a symmetric pseudo-distance d : U X U — > Z iff | satisfies ( Swcc) and (| Loop) in Definition 18.2.71 
(pageHHOl). 

(b) | is representable by an identity respecting symmetric pseudo-distance d : U x U — > Z iff | satisfies (| Succ), (| Con), 
and (| Loop) in Definition 18.2.71 fpage ll60|) . 

See jLMSOlj or [SchOi] , 

Proposition 8.2.7 

Let C be a propositional language. 

(a) A revision operation * is representable by a symmetric consistency and definability preserving pseudo-distance iff * 
satisfies (*Equiv), (*CCL), (*Succ), (*Loop). 

(b) A revision operation * is representable by a symmetric consistency and definability preserving, identity respecting 
pseudo-distance iff * satisfies (*Equiv), (*CCL), (*Succ), (*Con), (*Loop). 

See jLMSOlj or [Sch04] . 
Example 8.2.2 

This example shows the expressive weakness of revision based on distance: not all distance relations can be reconstructed 
from the revision operator. Thus, a revision operator does not allow to "observe" all distances relations, so transi tivity 
of < cannot necessarily be captured in a short condition, requiring arbitrarily long conditions, see Proposition 18.2.81 (page 

Note that even when the pseudo-distance is a real distance, the resulting revision operator \d does not always permit to 
reconstruct the relations of the distances: revision is a coarse instrument to investigate distances. 

Distances with common start (or end, by symmetry) can always be compared by looking at the result of revision: 
a \ d {b,b'} = b iSd(a,b) < d(a,b'), 
a \ d {b,b'} = b' iff d(a,b) > d(a,b'), 
a \ d {b, b'} = {b, b'} iff d(a, b) = d{a, b'). 

This is not the case with arbitrary distances d{x, y) and d(a, b), as this example will show. 

We work in the real plane, with the standard distance, the angles have 120 degrees, a' is closer to y than x is to y, a is 
closer to b than x is to y, but a' is farther away from b' than x is from y. Si milarly for 6, b'. B ut we cannot distinguish the 
situation {a,b,x,y} and the situation {a',b\x,y} through \ d ■ (See Diagram l8.2.1l fpage !163p ): 

Seen from a, the distances are in that order: y, b, x. 

Seen from a', the distances are in that order: y, b' , x. 

Seen from b, the distances are in that order: y, a, x. 

Seen from the distances are in that order: y,a',x. 

Seen from y, the distances are in that order: a/b,x. 

Seen from y, the distances are in that order: a'/b',x. 

Seen from x, the distances are in that order: y, a/b. 

Seen from x, the distances are in that order: y, a! jb' . 

Thus, any c \ d C will be the same in both situations (with a interchanged with a', b with b'). The same holds for any 
X \d C where X has two elements. 

Thus, any C \ d D will be the same in both situations, when we interchange a with a', and b with b' . So we cannot determine 
by \ d whether d(x,y) > d(a,b) or not. □ 
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Proposition 8.2.8 

There is no finite characterization of distance based | —operators. 

(Attention: this is, of course, false when we fix the left hand side: the AGM axioms give a finite characterization. So this 
also shows the strength of being able to change the left hand side.) 

Sec [ 5cE04| . 

8.2.2 Booth revision 
8.2.2.1 Introduction 

This material is due to Booth and co-authors. 

8.2.2.1.1 The problem we solve Booth and his co-authors have shown in very interesting papers, see [BN06] and 
BCMG06J, that many new approaches to theory revision (with fixed K) can be represented by two relations, < and <, 

where < is the usual ranked relation, and <\ is a sub-relation of < . They have, however, left open a characterization of the 
infinite case, which we treat here. 

The, for us, main definition they give is (in slight modification, we use the strict subrelations): 

Definition 8.2.8 

Given K, and < and <3, we define 

K Q(f>;= Th({w : w <\w' for some w' € mm(M(-i</>), <)}), 

i.e. K G cf> is given by all those worlds, which are below the closest 0— worlds, as seen from K. 

We want to characterize K (j), for fixed K. Booth et al. have done the finite case by working with complete consistent 
formulas, i.e. single models. We want to do the infinite case without using complete consistent theories, i.e. in the usual 
style of completeness results in the area. 

Our approach is basically semantic, though we use sometimes the language of logic, on the one hand to show how to 
approximate with formulas a single model, and on the other hand when we use classical compactness. This is, however, 
just a matter of speaking, and we could translate it into model sets, too, but we do not think that we would win much by 
doing so. Moreoever, we will treat only the formula case, as this seems to be the most interesting (otherwise the problem 
of approximation by formulas would not exist), and restrict ourselves to the definability preserving case. The more general 
case is left open, for a young researcher who wants to sharpen his tools by solving it. Another open problem is to treat 
the same question for variable K, for distance based revision. 

8.2.2.1.2 The framework For the reader's convenience, and to put our work a bit more into perspective, we repeat 
now some of the definitions and results given by Booth and his co-authors. 

Consequently, all material in this section is due to Booth and his co-authors. 

< will be a total Dre-order. anchored on M(K). the models of K. i.e. M(K) = miniW. <). the set of < —minimal worlds. 
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Definition 8.2.9 

(1) (<, ;<) is a K~ context iff < is a total pre-order on W, anchored on M{K), and ^ is a reflexive sub-relation of < . 

(2) K 9 4> ■= Th({w : w ^ w' for some w' G min(M (-^) , <)}) is called a basic removal operator. 

Theorem 8.2.9 

Basic removal is characterzed by: 

(Bl) K (j) — Cn(K (f) - Cn classical consequence, 

(B2) cp^KQcj,, 

(53) If |= <j> «-> <//, then K Q <f> = K Q <ff , 
{Bl) K Q ± = K, 

(55) Ke<pQ Cn{K U {-.<£}), 

(56) if cr e K (cr A 4>), then a<EKQ(aA(j)A tp), 
{Bl) if cr e X (cr A 0), then KQipCKQ{aA(f)), 

(58) (X cr) n (if 0) C X (cr A (f>), 

(59) if cp £ K (cr A 0), then K Q {a /\<fi) C K Q (p. 

(51) — (53) belong to the basic AGM contraction postulates, (54) — (55) are weakened versions of another basic AGM 
postulate: 

{Vacuity) lf<p(£K, then K Q <p = K 

which does not necessarily hold for basic removal operators. 

The same holds for the remaining two basic AGM contraction postulates: 

{Inclusion) K <p Q K 

{Recovery) K C Cn{{K Q<j>)\J {(j)}). 

The main definition towards the completeness result of Booth et al. is: 
Definition 8.2.10 

Given K and 0, the structure C{K, 0) is defined by: 
(<) w < w' iff -ia (jL K (-.a A ->a') and 
{■<) w ^ w' iff -.a ^ X -.a', 

where a is a formula which holds exactly in w, analogously for u/ and a'. 

Booth et al. then give a long list of Theorems showing equivalence between various postulates, and conditions on the 
orderings < and ^ . This, of course, shows the power of their approach. 

We give three examples: 
Condition 8.2.1 

(c) If (for each i = 1,2) Wi < w' for all w' , then w\ <w 2 - 

(d) If w\ < u>2 for all u>2, then w\ < W2 for all u>2- 

(e) If wi ^ u>2, then wi = W2 or ui\ < w' for all w' . 

Theorem 8.2.10 

Let be a basic removal operator as defined above. 

(1) satisfies one half of {Vacuity) : If <j> $ K, then K C K Q <p, 

(2.1) If (<, <) satisfies (c), then satisfies {Vacuity). 

(2.2) If satisfies {Vacuity), then C{K,Q) satsfies (c). 

(3.1) If (<, <) satisfies (d), then satisfies {Inclusion). 

(3.2) If satisfies {Inclusion), then C{K,Q) satsfies (d). 

(4.1) If (<, -<) satisfies (e), then satisfies {Recovery). 

(4.2) If satisfies {Recovery), then C{K, Q) satsfies (e). 
(5) The following are equivalent: 

(5.1) is a full AGM contraction operator, 

(5.2) satisfies (51) — (59), {Inclusion), and {Recovery) 
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We change perspective a little, and work directly with a ranked relation, so we forget about the (fixed) K of revision, and 
have an equivalent, ranked structure. We are then interested in an operator v, which returns a model set v{<j>) :— v(M((f))), 
where v{4>) fl M((f>) is given by a ranked relation <, and v{<j>) — M{<f>) := {x M{4>) : 3y G v{(f>) fl M{<p)(x < y)}, and < 
is an arbitrary subrelation of < . The essential problem is to find such y, as we have only formulas to find it. (If we had 
full theories, we could just look at all Th({y}) whether x G v(Th({y})).) There is still some more work to do, as we have 
to connect the two relations, and simply taking a ready representation result will not do, as we shall see. 

We first introduce some notation, then a set of conditions, and formulate the representation result. Soundness will be 
trivial. For completeness, we construct first the ranked relation <, show that it does what it should do, and then the 
subrelation <a. 

Notation 8.2.1 

We set 

fj, + (X) := v{X)C\X 
11- (X) := v(X)-X 
where X := M(<fr) for some 4>. 



Condition 8.2.2 

Ou-i) y n fi- (x) ^ (b ^ n+{Y)r\X = $ 
(fi-2)rn^-(i)/0^ u+(xuy) = u+(y) 
(p-3) rn^(i) ^ fj,-(Y)nx = 

(H~4) p+(A) C M +(B) - v~(A) C fi~(B) 

(ii"5) fi+(XUY) = n+{X) Ufi+(Y) fi-(XUY) = H~{X) U ^(Y) 

Fact 8.2.11 

and (/z0), (p C) for imply 

(1) fi+(x) n y + -» n /i-(y) = 

(2) in/j-(i) = f). 



Proof 

(1) Let /i+(A")n/i-(y) ^0, thenxn^^(y) 7^0, so by (/i"i) ^+(x)ny = 0. 

(2) Set X := Y, and use (^0), (/x C), (ii"l), (1). 
□ 



Proposition 8.2.12 

f : {M(0) : G F(C)} — ► £>£ is representable by < and <, where < is a smooth ranked relation, and < a subrelation of 
<, and n + {X) is the usual set of < —minimal elements of X, and H~{X) = {x ^ X : 3y G n + {X).(x <\ y)}, iff the following 
conditions hold: (fi G), (/i0), (/x =) for and (/x~l) — (m _ 5) for ^+ and ^ . 

Proof 

8.2.2.2.1 Soundness 

The first three hold for smooth ranked structures, and the others are easily verified. 



8.2.2.2.2 Completeness 

We first show how to generate the ranked relation <: 
There is a small problem. 

The authors first thought that one may take any result for ranked structures off the shelf, plug in the other relation 
somehow (see the second half), and that's it. No, that isn't it: Suppose there is x, and a sequence Xi converging to x in 
the usual topology. Thus, if x G M{cj)), then there will always be some Xi in M{<j>), too. Take now a ranked structure Z, 
where all the Xi are strictly smaller than x. Consider (i(4>), this will usually not contain x (avoid some nasty things with 
definability), so in the usual construction (^i below), x will not be forced to be below any element y, how high up y > x 
might be. However, there is ip separating x and y, e.g. x \= y (= ip, and if we take as the second relation just the 
ranking again, x G fi~(tp), so this becomes visible. 



166 CHAPTER 8. THEORY UPDATE AND THEORY REVISION 

We follow closely the strategy of the proof of 3.10.11 in Sch04], We will, however, change notation at one point: the 
relation R in |Sch04j is called < here. The proof goes over several steps, which we will enumerate. 

Note that by Fact l2.3.11 fpage l3"2")) . taken from |Sch04] . see also |GS08cj . (/x ||), (/xU), (/xU'), (/x =') hold, as the prerequisites 
about the domain are valid. 

(1) To generate the ranked relation <, we define two relations, <i and < 2: where ^i is the usual one for ranked structures, 
as defined in the proof of 3.10.11 of [Sch04], a <\ b iff a e u + (X), b G X, or a = b, and a ^< 2 b iff a G /i~(X), b G X. 

Moreover, we set a < b iff a b or a ^ 2 b. 

(2) Obviously, ;< is reflexive, we show that ^ is transitive by looking at the four different cases. 

(2.1) In Sch04 , it was shown that a <i b <i c a <\ c. For completeness' sake, we repeat the argument: Suppose a -<i b, 
b ^1 c, let a G fJ, + (A), b G A : b G u+(B), c G B. We show a G [i + (A U B). By (/x ||) a G u+(A U B) or b G xi+(A U B). 
Suppose b G H + {A U 5), then jtx+(vl U n A ^ 0, so by (/x =) fi+(A U B) D A = fi + (A), so a G /x+(yl U 5). 

(2.2) Suppose a^ib < 2 c, we show a ^ c : Let c G Y, 6 G (J,~(Y) n X, a G A« + W- Consider I U 7. As I n M~(Y) 7^ 0, 
by (/x~2) /1+(IU7) = u+(X), so a G (i+(lU7) and c G X U Y, so a ^1 c. 

(2.3) Suppose a^ 2 b<2 c, we show a < 2 c : Let c G Y, 6 G tx~(Y) n X, a G fi~{X). Consider X U Y. As X n /x _ (Y) / 0, 
by (/x~2) /i+(IUY) = /i+(X), so by (^~5) ^ _ (X U Y) = so a G ^(lur) and c G X U Y, so a ^ 2 c. 

(2.4) Suppose a ^ 2 Mi c, we show a ^ 2 c : Let c G Y, 6 G /x + (Y) H X, a G /tx~(X). Consider X U Y. As ix+(Y) n A ^ 0, 
A* + (A) C u+(X U Y). (Here is the argument: By (ix ||), /x+(X U Y) = m + (A) || H + (Y), so, if ti + (X) 2 V + ( x U Y), 
then /x+(X) n/j + (lUF) = 0, so x* + (X) n (X U Y - u+(X U Y)) ^ by (/J), so by (/xl/) u+{X U Y) = /x+(Y). But if 
H+{Y) CiX = n+{X U Y) n X ^ 0, /x+(X) = ^(X uy)nlby(/j =), so u+(X) n i* + (X U Y) ^ 0, contradiction.) So 
M~P0 C xt~(X U Y) by (/x~4), so c G X U Y, a G /x~(X U Y), and a < 2 c. 

(3) We also see: 

(3.1) a G H + (A), be A- t i+{A) ->b^a. 

(3.2) a G /x"(A), 6 G A -> 6 ^ a. 
Proof of (3.1): 

(a) -1(6 <\ a) was shown in |Sch04j . we repeat again the argument: Suppose there is B s.t. b G pi + (B), a G B. Then by 
(/xU) /x + (A U5)nB = f), and by (/xU') xi + (A UB) = xt + (A), but a G M + (^4) H B, contradiction. 

(b) Suppose there is B s.t. aeB.be u~(B). But A n /i~(B) ^ implies /x+(A) n 5 = by (/x~l). 
Proof of (3.2): 

(a) Suppose b <i a, so there is B s.t. a e B, b e /x+(B), so B n /x~(A) ^ 0, so n + {B) n A = by (/x~l). - - (b) Suppose 
b < 2 a, so there is B s.t. ae B, be u (B), soBH/j (A) ^ 0, so /x"(B) n A = by (/x"3). 

(4) Let, by Fact 14 . 2 ."3"3l f p age 1521 . 5 be a total, transitive, reflexive relation on U which extends ^ s.t. xSy,ySx — > x ^ y 
(recall that ^ is transitive and reflexive). But note that we loose ignorance, here. Define a < b iff aSb, but not bSa. If 
a_L6 (i.e. neither a < b nor & < a), then, by totality of S, aSb and bSa. < is ranked: If c < a_L6, then by transitivity of S 
cSb, but if bSc, then again by transitivity of S aSc. Similarly for c > al.b. 

(5) It remains to show that < represents /1 and is Y— smooth: 

Let a e A — fi + (A). By (/x0), 3b e zx + (A), so b <i a, but by case (3.1) above a ^ b, so bSa, but not aSb, so b < a, so 
a e A — u < (A). Let a e xt + (A), then for all a' e A a < a 1 , so aSa', so there is no a 1 e A a 1 < a, so a G u < (A). Finally, 
H + (A) ^ 0, all x e u + (A) are minimal in A as we just saw, and for a e A — ^ + (A) there is b G xt + (A), b <i a, so the 
structure is smooth. 

The subrelation <: 

Let a; G /i~(X), we look for y G /i + (A) s.t. x <\y where < is the smaller, additional relation. By the definition of the 
relation < 2 above, we know that < G^ and by (3.2) above <\ C< . 

Take an arbitrary enumeration of the propositional variables of C, pi : i < k. We will inductively decide for pi or ->pi. a 
etc. will denote a finite subsequence of the choices made so far, i.e. a = ±pj , . . . , ±Pi„ for some n < to. Given such a, 
M(a) := M(±pi ) PI . . . fi M(±pi n ). a + a' will be the union of two such sequences, this is again one such sequence. 

Take an arbitrary model m for C, i.e. a function m : v(C) — > {t, /}. We will use this model as a "strategy", which will tell 
us how to decide, if we have some choice. 

We determine y by an inductive process, essentially cutting away /x + (A) around y. We choose pi or ->pi preserving the 
following conditions inductively: For all finite sequences a as above we have: 

(1) M( ( x)n A1 + (A)^0, 

(2) x e fj,-(Xr\M(a)). 

For didactic reasons, we do the case po separately. 

Consider po- Either M(p )n/x + (X) ^ 0, or M(^p a )r\u+ (X) ^ 0, or both. If e.g. M(p )n/x+(A) ^ 0, but M(-.p )n/i+(X) = 
0, then we have no choice, and we take po, in the opposite case, we take -*po. E.g. in the first case, /i + (A n M(po)) = 
xt + (X), so x e /x~(A n M(po)) by (a< _ 4). If both intersections are non-empty, then by (/x~5) x G H~ [X n M(po)) or 
x G H~{X (~l M(-i£>o))> or both. Only in the last case, we use our strategy to decide whether to choose po or ->po : if 
m(po) = t, we choose if not, we choose -*po- 

Obviouslv. (1) and (2) above are satisfied. 
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Suppose we have chosen pi or ->pi for all i < a, i.e. defined a partial function from v(C) to {t, /}, and the induction 
hypotheses (1) and (2) hold. Consider p a . If there is no finite subsequence cr of the choices done so far s.t. M(a)C\M(p a ) n 
li + (X) — 0, then p a is a candidate. Likewise for ~^p a . 

One of p a or ^p Q is a candidate: 

Suppose not, then there are a and cr' subsequences of the choices done so far, and M(a) n M(p Q ) n /i + (X) = and 
M(cr') n M(^p Q ) n n+(X) = 0. But then M(a + cr') n M + PO = M(cr) n M(ct') n n + {X) C M(ct) n M(p ) n M + PO U 
M(c') n M(-ip a ) n M + (X) = 0) contradicting (1) of the induction hypothesis. 

So induction hypothesis (1) will hold again. 

Recall that for each candidate and any a by induction hypothesis (1) M{a) n M(p a ) n /i + (X) = /x + (M(cr) n M(p Q ) n X) 
by (/x ='), and also for a C cr' fx+(M(cr') n M(p a ) n X) C ix+(M(cr) n M(p a ) n X) by (/x =') and M(cr') C Af(cr), and thus 
by (/x-4) fi-(M((/)nM(p a )nJf) C /j,~(M(a) D M(p a ) n X). 

If we have only one candidate left, say e.g. p Q , then for each sufficiently big sequence a M(a) PI M(-ip a ) n fi + (X) = 0, 
thus for such a ix+ (M (cr) n M '(p a ) n X) = M (a) D M (p a ) n n + (X) = M(a) n/x+(X) = ix+(M(cr) nl), and thus by (xx"4) 
/x _ (M((t) (~1 M(p a ) n X) = /x _ (M (cr) n X), so ~^p a plays no really important role. In particular, induction hypothesis (2) 
holds again. 

Suppose now that we have two candidates, thus for p a and -ip a and each a M(a) (~l M(p a ) n /i + (X) ^ and M(cr) Pi 

MH> a )nxx+(x)^0. 

By the same kind of argument as above we see that either for p a or for ->p a , or for both, and for all a x G /x _ (Af(cr) n 
M(p a ) nljone ix"(M(o-) n M(-Tp„) ni). 

If not, there are cr and cr' and x £ ^ (M(a)nM(p a )nX) D \T (M(a + a')r\M(p a )nX) and x £ ^ (M(a')nM(^p a )nX) 
D fj,-(M(a + a')r\M(->p a )r\X), but /x~ (M(<r + </) H X) = pT (M(a + cr') n M{p a ) n X) U ix-(M(o- + cr') nM(np a ) nl), 
so [i~(M(cr + cr') n X), contradicting the induction hypothesis (2). 

If we can choose both, we let the strategy decide, as for po. 
So induction hypotheses (1) and (2) will hold again. 

This gives a complete description of some y (relative to the strategy!), and we set x < y. We have to show: for all Y £ y 
x G jtx _ (y) <-> x G /x<|(y) 3 j/ G /x + (y).x <] y. " — * ": As we will do above construction for all Y, it suffices to show that 
y G fi + (X). Conversely, if the y constructed above is in fi + (Y), then x has to be in ix~(y). 

If y /i + (A), then Th(y) is inconsistent with Th(/i + (X)), as /i + is definability preserving, so by classical compactness 
there is a suitable finite sequence a with M(cr) n xt + (X) = 0, but this was excluded by the induction hypothesis (1). So 
yen+(X). 

Suppose y G it + (F), but x ^ ix~(y). So y G ^ + (y) an d y G xt + (X), and Y = M(<j>) for some (j), so there will be a suitable 
finite sequence cr s.t. for all cr' with a C a' M(cr') fll C M(<fr) = Y, and by our construction x G fj,~(M(a') n X). As 

y g m + (x) n n+(Y) n (M(cr') n X), n+(M{a') nx)c so by (/x-4) /x-(M(cr') ni)c /i _ (y), so x g ix-(y), 

coniraiiiciicw,. 

We do now this construction for all strategies. Obviously, this does not modify our results. 
This finishes the completeness proof. □ 

As we postulated definability preservation, there are no problems to translate the result into logic. (Note that v was 

applied to formula defined model sets, but the resulting sets were perhaps theory defined model sets.) 

Comment: 

One might try a construction similar to the one for Counterfactual Conditionals, see [SM94 , and try to patch together 
several ranked structures, one for each K on the left, to obtain a general distance, by repeating elements. 

So we would have different "copies" of A, say Ai, more precisely of its elements, and the natural definition seems to be: 
A * (j) h iji iff for all i Ai * I- <ip, so A | B = (J{A 4 | B : i e I}. 

But this does not work: Take A := {a, a', a"}, B := {b, b'}, with A \ B := {b, b'}, and a \ B — a' \ B — a" \ B = {b}. Then 
for all copies of the singletons, the result cannot be empty, but must be {&}. But A | B can only be a "partial" union of 
the x | B, x G A, so it must be {b} for all copies of A, contradiction. 

(Alternative definitions with copies fail too, but no systematic investigation was done.) 
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Chapter 9 



An analysis of defeasible inheritance 
systems 

9.1 Introduction 
9.1.1 Terminology 

"Inheritance" will stand here for "nonmonotonic or defeasible inheritance" . We will use indiscriminately "inheritance 
system" , "inheritance diagram" , "inheritance network" , "inheritance net" . 

In this introduction, we first give the connection to reactive diagrams, then give the motivation, then describe in very brief 
terms some problems of inheritance diagrams, and mention the basic ideas of our analysis. 



9.1.2 Inheritance and reactive diagrams 

Inheritance sytems or diagrams have an intuitive appeal. They seem close to human reasoning, natural, and are also 
implemented (see |Mor98j ) . Yet, they are a more procedural approach to nonmonotonic reasoning, and, to the authors' 
knowledge, a conceptual analysis, leading to a formal semantics, as well as a comparison to more logic based formalisms 
like the systems P and R of preferential systems are lacking. We attempt to reduce the gap between the more procedural 
and the more analytical approaches in this particular case. This will also give indications how to modify the systems P 
and R to approach them more to act ual huma n reasoning. Moreover, we establish a link to multi- valued lo gics and the 
logics of information sources (see e.g. [ABK07] and forthcoming work of the same authors, and also |BGH95| ). 

An in herit ance net is a directed graph with two types of connections between nodes x — > y and x -/* y. Diagram 19.2.21 
fpage [T75|) is such an example. The meaning of x — > y is that x is also a y and the meaning of x -/-> y is that x is not a y. 
We do not allow the combinations x-/^yy^zorx-/->-y—>z but we do allow x — > y — > z and x — > y -/-> z. 

Given a complex diagram such as Diagram l9.2.4l fpage [l~8T]) and two points say z and y, the question we ask is to determine 
from the diagram whether the diagram says that 

(1) z is y 

(2) z is not y 

(3) nothing to say. 

Since in Diagram l9.2.4l fpage ll81[) there are paths to y from z either through x or through v, we need to have an algorithm 
to decide. Let A be such an algorithm. 

We need A to decide 

(1) are there valid paths from z to y 

(2) of the opposing paths (one which supports l z is y' and one which supports l z is not j/'), which one wins (usually 
winning makes use of being more specific but there are other possible options). 

So for example, in Diagram 19 . 2.41 ( page 1181]) . the connection x — > v makes paths though x more specific than paths through 
v. The question is whether we have a valid path from z to x. 

In the literature, as well as in this paper, there are algorithms for deciding the valid paths and the relative specificity of 
paths. These are complex inductive algorithms, which may need the help of a computer for the case of the more complex 
diagrams. 

It seems that for inheritance networks we cannot adopt a simple minded approach and just try to 'walk' on the graph from 
z to y, and depending on what happens during this 'walk' decide whether z is y or not. 

To explain what we mean, suppose we give the network a different meaning, that of fluid flow, x — > y means there is an 
open pipe from x to y and x -/-> y means there is a blocked pipe from x to y. 

To the question 'can fluid flow from z to y in Diagram 19.2.41 fpagc [TBTj) . there is a simple answer: 
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Similarly we can ask in the inheritance network something like (*) below: 



(*) z is (resp. is not) y according to diagram D, iff there is a path it from z to y in D such that some 
non-inductive condition i/j(tt) holds for the path it. 

Can we offer the reader such a ipl 

If we do want to help the user to 'walk' the graph and get an answer, we can proceed as one of the following options: 

Option 1. Add additional annotations to paths to obtain D* from D, so that a predicate tp can be defined on D* using these 
annotations. Of course these annotations will be computed using the inductive algorithm in A, i.e. we modify A to 
A* which also executes the annotations. 

Option 2. Find a transformation r on diagrams D to transform D to D 1 = t(D), such that a predicate -0 can be found for D'. 
So we work on D' instead of on D. 

We require a compatibility condition on options 1 and 2: 
(CI) If we apply A* to D* we get D* again. 
(C2) t(t(D))=t(D). 

We now present the tools we use for our annotations and transformation. These are the reactive double arrows. 
Consider the following Diagram 19.1.11 (page I170[) : 



Reactive graph 




Diagram 9.1.1 



We want to walk from a to e. If we go to c, a double arrow from the arc a — > c blocks the way from d to e. So the only 
way to go to e is throgh b. If we start at a' there is no such block. It is the travelling through the arc (a, c) that triggers 
the double arrow (a, c) ^> (d, e). 

We want to use -» in A* and in r. 

So in Diagram 19.2.41 (page fT8T|) the path u — > x -f* y is winning over the path u — > v — > y, because of the specificity arrow 
x — > v. However, if we start at z then the path z — > u — > v — > y is valid because of z y4 x. We can thus add the following 
double arrows to the diagram. We get Diagram 19. 1.21 (page flTOj) . 
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y 




Diagram 9.1.2 



If we start from u, and go to u — > v, then v — > y is cancelled. Similary u — > x — > v cancelled v — > y. So the only path is 
u — ► x y. 

If we start from z then u — > x is cancelled and so is the cancellation (u,v) (v,y). hence the path z — > u — > v — > y is 
open. 

We are not saying that t( Diaaran i9.l.i} — Diaararr id. 1.21 but something effectively similar will be done by r. 

We emphasize that the construction depends on the point of departure. Consider Diagram 19.2.41 (page [T8l"|) . Starting at 
U, we will have to block the path uvy Starting at z, the path zuvy has to be free. See Diagram l9.1.3l (page [T7T|) . 



The problem of downward chaining - reactive 



y 




z 

Diagram 9.1.3 

So we cannot just add a double arrow from u — > v to v — > y, blocking v — > y, and leave it there when we start from z. We 
will have to erase it when we change the origin. 

At the same time, this shows an advantage over just erasing the arrow v — * y : 

When we change the starting point, we can erase simply all double arrows, and do not have to remember the original 
diagram. 
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First, if all possible paths are also valid, there is nothing to do. (At the same time, this shows that applying the procedure 
twice will not result in anything new.) 

Second, remember that we have an upward chaining formalism. So if a potential path fails to be valid, it will do so at the 
end. 

Third, suppose that we have two valid paths a : x — > y and r : x — > y. 

If they are negative (and they are either both negative or positive, of course), then they cannot be continued. So if there 
is an arrow y — ► z or y -/-> z, we will block it by a double arrow from the first arrow of a and from the first arrow of r to 
y — > z (y /> z respectively). 

If they are positive, and there is an arrow y — » z or y ■/* z, both ay — > z and ry — ► z are potential paths (the case y -/-> z 
is analogue). One is valid iff the other one is, as a and r have the same endpoint, so preclusion, if present, acts on both 
the same way. If they are not valid, we block y — > z by a double arrow from the first arrow of a and from the first arrow 
of t to y — ► z. 

Of course, if there is only one such a, we do the same, there is just to consider to see the justification. 
We summarize: 

Our algorithm switches impossible continuations off by making them invisible, there is just no more arrow to concatenate. 
As validity in inheritance networks is not forward looking - validity of a : x — > y does not depend on what is beyond y - 
validity in the old and in the new network starting at x are the same. As we left only valid paths, applying the algorithm 
twice will not give anything new. 

We illustrate this by considering Diagram 19.3. II fpage !184p . 

First, we add double arrows for starting point c, see Diagram 19.1.41 f page 1172")) . 




X 

Diagram 9.1.4 

and then for starting point x, see Diagram 19. 1.51 (page ll73|) . 
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Diagram 9.1.5 



For more information on reactive diagrams, see also [Gab08b], and an earlier version: |Gab04j . 



9.1.3 Conceptual analysis 

Inheritance diagrams are deceptively simple. Their conceptually complicated nature is seen by e.g. the fundamental 
difference between direct links and valid paths, and the multitude of existing formalisms, upward vs. downward chaining, 
intersection of extensions vs. direct scepticism, on-path vs. off-path preclusion (or pre-emption), split validity vs. total 
validity preclusion etc., to name a few, see the discussion in Section [9.2.31 (page [181]) . Such a proliferation of formalisms 
usually hints at deeper problems on the conceptual side, i.e. that the underlying ideas are ambigous, and not sufficiently 
analysed. Therefore, any clarification and resulting reduction of possible formalisms seems a priori to make progress. Such 
clarification will involve conceptual decisions, which need not be shared by all, they can only be suggestions. Of course, a 
proof that such decisions are correct is impossible, and so is its contrary. 

We will introduce into the analysis of inheritance systems a number of concepts not usually found in the field, like multiple 
truth values, access to information, comparison of truth values, etc. We think that this additional conceptual burden pays 
off by a better comprehension and analysis of the problems behind the surface of inheritance. 

We will also see that some distinctions between inheritance formalisms go far beyond questions of inheritance, and concern 
general problems of treating contradictory information - isolating some of these is another objective of this article. 

The text is essentially self-contained, still some familiarity with the basic co ncepts of inher itance s ystems and nonmonotonic 
logics in general is helpful. For a presentation, the reader might look into |Sch97-2j and |Sch04| . 

The text i s org anized as follows. After an introduction to inheritance theory, conn ectio ns wit h rea ctive diagrams in Section 
19.31 fpage IT52)) . and big and small subsets and the systems P and R in Section I9T21 (page [TT4")) . we turn to an informal 
description of the fundamental differences between inheritance and the systems P and R in Section [9. 4. 21 fpage !185[) . give 
an analysis of inheritance systems in terms of information and information flow in Section 19.4.31 (page I186P , then in terms 
of reasoning with prototypes in Section 19.4.41 (page I188|) , and conclude in Section 19.51 (page 1 1 90[) with a translation of 
inheritance into (necessarily deeply modified) coherent systems of big and small sets, respectively logical systems P and R. 
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9.2 Introduction to nonmonotonic inheritance 



9.2.1 Basic discussion 

We g ive her e an informal discus sion. T he re ader unfamiliar with inheritance systems should consult in parallel Definition 
19.2.31 (page [178]) and Definition 19.2.41 (page [178]). As there are many variants of the definitions, it seems reasonable to 
discuss them before a formal introduction, which, otherwise, would seem to pretend to be definite without being so. 

9.2.1.0.3 (Defeasible or nonmonotonic) inheritance networks or diagrams 

Nonmonotonic inheritance systems describe situations like "normally, birds fly", written birds — ► fly. Exceptions are 
permitted, "normally penguins don't fly" , penguins -/->■ fly. 

Definition 9.2.1 

A nonmonotonic inheritance net is a finite DAG, directed, acyclic graph, with two types of arrows or links, — > and and 
labelled nodes. We will use T etc. for such graphs, and a etc. for paths - the latter to be defined below. 

Roughly (and to be made precise and modified below, we try to give here just a first intuition), X — ► Y means that 
"normal" elements of X are in Y, and X Y means that "normal" elements of X are not in Y. In a semi-quantitative 
set interpretation, we will read "most" for "normal", thus "most elements of X are in Y", "most elements of X are not in 
Y" , etc. These are by no means the only interpretations, as we will see - we will use these expressions for the moment just 
to help the reader's intuition. We should add immediately a word of warning: "most " is h ere n ot necessarily, but only by 
default, transitive, in the following sense. In the Tweety diagram, see Diagram 19.2. II fpage ll74"|) below, most penguins are 
birds, most birds fly, but it is not the case that most peng uins fly. This is the problem of transfer of relative size which 
will be discussed extensively, especially in Section 19.51 (page I190j) . 

According to the set interpretation, we will also use informally expressions like X P\Y, X—Y, CX - where C stands for 
set complement -, etc. But we will also use nodes informally as formulas, like X AY, X A ~Y, ->X, etc. All this will only 
be used here as an appeal to intuition. 

Nodes at the beginning of an arrow can also stand for individuals, so Tweety y4 fly means something like: "normally, 
Tweety will not fly" . As always in nonmonotonic systems, exceptions are permitted, so the soft rules "birds fly" , "penguins 
don't fly", and (the hard rule) "penguins are birds" can coexist in one diagram, penguins are then abnormal birds (with 
respect to flying). The direct link penguins -/-> fly will thus be accepted, or considered valid, but not the composite path 
penguins — > birds — » fly, by specificity - see below. This is illustrated by Diagram 19.2.11 (page [T74]) . where a stands for 
Tweety, c for penguins, b for birds, d for flying animals or objects. 

(Remark: The arrows a — > c, a — > b, and c — > b can also be composite paths - see below for the details.) 



The Tweety diagram 




Diagram 9.2.1 



(Of course, there is an analogous case for the opposite polarity, i.e. when the arrow from b to d is negative, and the one 
from c to d is positive.) 
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We will write r |= a, if a is a valid path in the network T, and if x is the origin, and y the endpoint of a, and a is positive, 
we will write T |= xy, i.e. we will accept the conclusion that x's are y's, and analogously T \= xy for negative paths. Note 
that we will not accept any other conclusions, only those established by a valid path, so many questions about concl usions 
have a triv ial negative answer: there is obviously no path from x to y. E.g., there is no path from b to c in Diagram 19.2. II 
(page fT74|) . Likewise, there are no disjunctions, conjunctions etc. in our conclusions, and negation is present only in a 
strong form: "it is not the case that x's are normally y's" is not a possible conclusion, only "x's are normally not y's" is 
one. Also, possible contradictions are contained, there is no EFQ. 

To simplify matters, we assume that for no two nodes x, y G T x — > y and x -/* y are both in T, intuitively, that T is free 
from (hard) contradictions. This restriction is inessential for our purposes. We admit, however, soft contradictions, and 
preclusion, which allows us to solve some soft contradictions - as we already did in the penguins example. We will also 
assume that all arrows stand for rules with possibly exceptions, again, this restriction is not important for our purposes. 
Moreover, in the abstract treatment, we will assume that all nodes stand for (nonempty) sets, though this will not be true 
for all examples discussed. 

This might be t he place for a remark on absence of cycles. Suppose we also have a positive arrow from b to c in Diagram 
19.2.11 (page fT74|) . Then, the concept of preclusion collapses, as there are now equivalent arguments to accept a — > b — > d 
and a — ► c ■/* d. Thus, if we do not want to introduce new complications, we cannot rely on preclusion to decide conflicts. 
It seems t h at this would ch ange the whole outlook on such diagrams. The interested reader will find more on the subject 
in |Ant97j . [Ant99j . |AlrE05] . 

Inheritance networks were introduced about 20 years ago (see e.g. |Tou84j . |Tou86| . |THT87j ). and exist in a multitude of 
more or less differing formalisms, see e.g. |Sch97-2j for a brief discussion. There still does not seem to exist a satisfying 
semantics for these networks. The authors' own attempt [Sc h90| is an a posteriori semantics, which cannot explain or 
criticise or decide between the different formalisms. We will give here a conceptual analysis, which provides also at least 
some building blocks for a semantics, and a translation into (a modified version of) the language of small and big subsets, 
familiar from preferential structures, see Definition 13.2.2.61 (page HI?!) . 

We will now discuss the two fundamental situations of contradictions, then give a detailed inductive definition of valid paths 
for a certain formalism so the reader has firm ground under his feet, and then present briefly some alternative formalisms. 

As in all of nonmonotonic reasoning, the interesting questions arise in the treatment of contradictions and exceptions. The 
difference in quality of information is expressed by "preclusion" (or "pre — emption"). The basic diagram is the Tweety 
diagram, see Diagram 19.2. II fpage ll74[) . 

Unresolved contradictions give cither rise to a branching into different extensions, which may roughly be seen as maximal 
consistent subsets, or t o mutu al ca ncella tion in directly sceptical approaches. The basic diagram for the latter is the Nixon 
Diamond, see Diagram 19.2.21 (page [ITS"]) , where a = Nixon, b — Quaker, c = Republican, d = pacifist. 

In the directly sceptical approach, we will not accept any path from a to d as valid, as there is an unresolvable contradiction 
between the two candidates. 



The Nixon Diamond 




Diagram 9.2.2 



The extensions approach can be turned into an indirectly sceptical one, by forming first all extensions, and then taking the 
intersection of either the sets of valid paths, or of valid conclusions, see [MS91] for a detailed discussion. See also Section 
19.2.31 (pagellSID for more discussion on directlv vs. indirectlv sceptical approaches. 
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9.2.1.0.4 Preclusion 

In the above example, our intuition tells us that it is not admissible to conclude from the fact that penguins are birds, 
and that most birds fly that most penguins fly. The horizontal arrow c — > b together with c -/-> d bars this conclusion, it 
expresses specificity. Consequently, we have to define the conditions under which two potential paths neutralize each other, 
and when one is victorious. The idea is as follows: 1) We want to be sceptical, in the sense that we do not believe every 
potential path. We will not arbitrarily chose one either. 2) Our scepticism will be restricted, in the sense that we will 
often make well defined choices for one path in the case of conflict: a) If a compound potential path is in conflict with a 
direct link, the direct link wins, b) Two conflicting paths of the same type neutralize each other, as in the Nixon Diamond, 
where neither potential path will be valid, c) More specific information will win over less specific one. 

(It is essential in the Tweety diagram that the arrow c -/* d is a direct link, so it is in a way stronger than compound 
paths.) The arrows a — > 6, a — > c, c — > b can also be composite paths: The path from c to b (read c C . . . C b, where C 
stands here for soft inclusion), however, tells us, that the information coming from c is more specific (and thus considered 
more reliable), so the negative path from a to d via c will win over the positive one via b. The precise inductive definition 
will be given below. This concept is evidently independent of the lenght of the paths, a ■ ■ ■ — ► c may be much longer than 
a ■ ■ ■ — > b, so this is not shortest path reasoning (which has some nasty drawbacks, discussed e.g. in [HTT87]). 

A final remark: Obviously, in some cases, it need not be specificity, which decides conflicts. Consider the case where 
Tweety is a bird, but a dead animal. Obviously, Tweety will not fly, here because the predicate "dead" is very strong and 
overrules many normal properties. When we generalize this, we might have a hierarchy of causes, where one overrules the 
other, or the result may be undecided. For instance, a falling object might be attracted in a magnetic field, but a gusty 
wind might prevent this, sometimes, with unpredictable results. This is then additional information (strength of cause), 
and this problem is not addressed directly in traditional inheritance networks, we would have to introduce a subclass "dead 
bird" - and subclasses often have properties of "pseudo-causes" , being a penguin probably is not a "cause" for not flying, 
nor bird for flying, still, things change from class to subclass for a reason. 

Before we give a formalism based on these ideas, we refine them, adopt one possibility (but indicate some modifications), 
and discuss alternatives later. 



9.2.2 Directly sceptical split validity upward chaining off-path inheritance 

Our approach will be directly sceptical, i.e. unsolvable contradictions result in the absen ce of valid paths, it is upward 
chaining, and split- validity for preclusions (discussed below, in particular in Section 19.2.31 (page I181[) ) . We will indicate 
modifications to make it extension based, as well as for total validity preclusion. This approach is strongly inspired by 
classical work in the field by Horty, Thomason, Touretzky, and others, and we claim no priority whatever. If it is new at 
all, it is a very minor modification of existing formalisms. 

Our conceptual ideas to be presented in detail in Section 19.4.31 (page 1186)) make split validity, off-path preclusion and 
upward chaining a natural choice. 

For the reader's convenience, we give here a very short resume of these ideas: We consider only arrows as information, e.g. 
a — > b will be considered information b valid at or for a. Valid composed positive paths will not be considered information 
in our sense. They will be seen as a way to obtain information, so a valid path a : x . . . — > a makes information b accessible 
to x, and, secondly, as a means of comparing information strength, so a valid path er : a . . . . — ► a' will make information 
at a stronger than information at a! . Valid negative paths have no function, we will only consider the positive initial part 
as discussed above, and the negative end arrow as information, but never the whole path. 

Choosing direct scepticism is a decision beyond the scope of this article, and we just make it. It is a gen eral qu estio n how 
to treat contradictory and absent information, and if they are equivalent or not, see the remark in Section [9.4.4| fpage [T55|) . 
(The fundamental difference between intersection of extensions and direct scepticism for defeasible inheritance was shown 
in [Sch93 .) See also Section [9.2.31 fpage !181|) for more discussion. 

We turn no w to t he an noun ced variants as well as a finer distinction within the directly sceptical approach. Again, see 
also Section |9. 2. 31 (page !181|) for more discussion. 

Our approach generates another problem, essentially that of the treatment of a mixture of contradictory and concordant 
information of multiple strengths or truth values. We bundle the decision of this problem with that for direct scepticism 
into a "plug-in" decision, which will be used in three approaches: the conceptual ideas, the inheritance algorithm, and the 
choice of the reference class for subset size (and implicitly also for the treatment as a prototype theory). It is thus well 
encapsulated, and independent from the context. 

These decisions (but, perhaps to a lesser degree, (1)) concern a wider subject than only inheritance networks. Thus, it 
is not surprising that there are different formalisms for solving such networks, deciding one way or the other. But this 
multitude is not the fault of inheritance theory, it is only a symptom of a deeper question. We first give an overview for a 
clearer overall picture, and discuss them in detail below, as they involve sometimes quite subtle questions. 

(1) Upward chaining against downward or double chaining. 

(2.1) Off-path against on-path preclusion. 

(2.2) Split validity preclusion against total validity preclusion. 

(3) Direct scepticism against intersection of extensions. 

(4) Treatment of mixed contradiction and preclusion situations, no preclusion by paths of the same polarity. 

(1) When we interpret arrows as causation (in the sense that X — > Y expresses that condition X usually causes condition Y 
to result), this can also be seen as a difference in reasoning from cause to effect vs. backward reasoning, looking for causes 
for an effect. (A word of warning: There is a well-known article SL89J from which a superficial reader might conclude that 
upward chaining is tractable, and downward chaining is not. A more careful reading reveals that, on the negative side, 
the authors only show that double chaining is not tractable.) We will adopt upward chaining in all our approaches. See 
Section [5.4.41 (page 11881) for more remarks. 
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absolute comparison of truth values, independent of reachability of information. Thus, in Diagram 19.2.11 (page [ITU) , the 
comparison between the truth values "penguin" and "bird" is absolute, and does not depend on the point of view "Tweety" , 
as it can in total validity preclusion - if we continue to view preclusion as a comparison of information strength (or truth 
value). This question of absoluteness transcends obviously inheritance networks. Our decision is, of course, again uniform 
for all our approaches. 

(3) This point, too, is much more general than the problems of inheritance. It is, among other things, a question of whether 
only the two possible cases (positive and negative) may hold, or whether there might be still other possibilities. See Section 
EM (page [TSS]). 

(4) This concerns the treatment of truth values in more complicated situations, where we have a mixture of agreeing and 
contradictory information. Again, this problem reaches far beyond inheritance networks. 

We will group (3) and (4) together in one general, "plug-in" decision, to be found in all approaches we discuss. 
Definition 9.2.2 

This is an informal definition of a plug-in decision: 

We describe now more precisely a situation which we will meet in all contexts discussed, and whose decision goes beyond 
our problem - thus, we have to adopt one or several alternatives, and translate them into the approaches we will discuss. 
There will be one global decision, which is (and can be) adapted to the different contexts. 

Suppose we have information about <p and ipj where (j) and ip are presumed to be independent - in some adequate sense. 
Suppose then that we have information sources A* : i £ I and Bj : j s J, where the Ai speak about <j> (they say <j> or 
and the Bj speak about ip in the same way. Suppose further that we have a partial, not necessarily transitive (!), ordering 
< on the information sources Ai and Bj together. X < Y will say that X is better (intuition: more specific) than Y. (The 
potential lack of transitivity is crucial, as valid paths do not always concatenate to valid paths - just consider the Tweety 
diagram.) 

We also assume that there are contradictions, i.e. some Ai say </>, some -i<f>, likewise for the Bj - otherwise, there are no 
problems in our context. 

We can now take several approaches, all taking contradictions and the order < into account. 

• (PI) We use the global relation <, and throw away all information coming from sources of minor quality, i.e. if there 
is X such that X < Y, then no information coming from Y will be taken into account. Consequently, if Y is the only 
source of information about 0, then we will have no information about 4>. This seems an overly radical approach, as 
one source might be better for </>, but not necessarily for tjj, too. 

If we adopt this radical approach, we can continue as below, and can even split in analogue ways into (Pl.l) and 
(PI. 2), as we do below for (P2.1) and (P2.2). 

• (P2) We consider the information about separately from the information about ip. Thus, we consider for (p only 
the Ai, for ip only the Bj. Take now e.g. </> and the Ai. Again, there are (at least) two alternatives. 

— (P2.1) We eliminate again all sources among the Ai for which there is a better A^, irrespective of whether they 
agree on <p or not. 

* (a) If the sources left are contradictory, we conclude nothing about (p, and accept for <p none of the sources. 
(This is a directly sceptical approach of treating unsolvable contradictions, following our general strategy.) 

* (b) If the sources left agree for (p, i.e. all say <p, or all say ->4>, then we conclude <p (or ~^(p), and accept for cp 
all the remaining sources. 

— (P2.2) We eliminate again all sources among the Aj for which there is a better A^, but only if Ai and Ay have 
contradictory information. Thus, more sources may survive than in approach (P2.1). 

We now continue as for (P2.1): 

* (a) If the sources left are contradictory, we conclude nothing about <p, and accept for <p none of the sources. 

* (b) If the sources left agree for <p, i.e. all say (p, or all say -«jf>, then we conclude <p (or -k/>), and accept for <p 
all the remaining sources. 

The difference between (P2.1) and (P2.2) is illustrated by the following simple example. Let A < A' < A" , but A -jtt A" 
(recall that < is not necessarily transitive), and A \= <p, A' \= ~^<p, A" |= -i</>. Then (P2.1) decides for <p {A is the only 
survivor), (P2.2) does not decide, as A and A" are contradictory, and both survive in (P2.2). 

There are arguments for and against either solution: (P2.1) gives a uniform picture, more independent from <p, (P2.2) gives 
more weight to independent sources, it "adds" information sources, and thus gives potentially more weight to information 
from several sources. (P2.2) seems more in the tradition of inheritance networks, so we will consider it in the further 
development. 

The reader should note that our approach is quite far from a fixed point approach in two ways: First, fixed point approaches 
seem more appropriate for extensions-based approaches, as both try to collect a maximal set of uncontradictory information. 
Second, we eliminate information when there is better, contradicting information, even if the final result agrees with the 
first. This, too, contradicts in spirit the fixed point approach. 

After these preparations, we turn to a formal definition of validity of paths. 



9.2.2.0.5 The definition of |= (i.e. of validity of paths) 



178 CHAPTER 9. AN ANALYSIS OF DEFEASIBLE INHERITANCE SYSTEMS 

just a set of points and arrows, thus e.g. nj/eT and ieT are defined, when x is a point in T, and x — > y an arrow in 
r. Recall that we have two types of arrows, positive and negative ones. 

We first define generalized and potential paths, then the notion of degree, and finally validity of paths, written r |= cr, if 
a is a path, as well as T \= xy, if T \= a and a : x y. 

Definition 9.2.3 

(1) Generalized paths: 

A generalized path is an uninterrupted chain of positive or negative arrows pointing in the same direction, more precisely: 
upeT^npisa generalized path, 
j:/>per^i/>pisa generalized path. 

If x ■ ■ ■ — ► p is a generalized path, and p — > q G T, then x ■ ■ ■ —> p — > q is a generalized path, 
if x ■ ■ ■ — > p is a generalized path, and p/»5ef, then a; • • • — > p -f* q is a generalized path. 

(2) Concatenation: 

If a and r are two generalized paths, and the end point of cr is the same as the starting point of r, then a o r is the 
concatenation of cr and r. 

(3) Potential paths (pp.): 

A generalized path, which contains at most one negative arrow, and this at the end, is a potential path. If the last link is 
positive, it is a positive potential path, if not, a negative one. 

(4) Degree: 

As already indicated, we shall define paths inductively. As we do not admit cycles in our systems, the arrows define a 
well-founded relation on the vertices. Instead of using this relation for the induction, we shall first define the auxiliary 
notion of degree, and do induction on the degree. Given a node x (the origin), we need a (partial) mapping / from the 
vertices to natural numbers such that p^ijorp/xjel implies f(p) < f(q), and define (relative to x) : 

Let cr be a generalized path from x to y, then degr,x(&) '■= degr, x (y) '■= the maximal length of any generalized path parallel 
to cr, i.e. beginning in x and ending in y. 

Definition 9.2.4 

Inductive definition of T \= cr : 
Let a be a potential path. 

• Case /: 

a is a direct link in T. Then T \= a 

(Recall that we have no hard contradictions in L.) 

• Case II: 

cr is a compound potential path, degr >a {a) — n, and T |= r is defined for all r with degree less than n - whatever 
their origin and endpoint. 

• Case II. 1: 

Let cr be a positive pp. x ■ ■ ■ — > u — > y, let cr' := x ■ ■ • — > u, so cr = cr' o u — > y 
Then, informally, T |= cr iff 

(1) (1) cr is a candidate by upward chaining, 

(2) (2) cr is not precluded by more specific contradicting information, 

(3) (3) all potential contradictions are themselves precluded by information contradicting them. 

Note that (2) and (3) are the translation of (P2.2) in Definition (page fT77|) . 
Formally, r ^ a iff 

(1) (1) r |= cr' and u -» y E F. 

(The initial segment must be a path, as we have an upward chaining approach. This is decided by the induction 
hypothesis.) 

(2) (2) There are no v, r, t' such that v -/-> y G T and r |= r := x ■ ■ ■ — > v and T \= r' := v ■ ■ ■ — > u. (r may be the 
empty path, i.e. x — v.) 

(cr itself is not precluded by split validity preclusion and a contradictory link. Note that ro^/ty need not be 
valid, it suffices that it is a better candidate (by t').) 

(3) (3) all potentially conflicting paths are precluded by information contradicting them: 

For all v and r such that v y G T and T (= r := x ■ ■ ■ — > v (i.e. for all potentially conflicting paths r o v -f+ y) 
there is z such that z — > y G T and either 

z — x 

(the potentially conflicting pp. is itself precluded by a direct link, which is thus valid) 
or 
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• Case II. 2: The negative case, i.e. a a negative pp. x ■ ■ ■ — > u y, a' := x ■ ■ ■ — » it, a = a' o u y is entirely 
symmetrical. 



Remark 9.2.1 

The following remarks all concern preclusion. 

(1) Thus, in the case of preclusion, there is a valid path from x to z, and z is more specific than v, so tov y4 y is precluded. 
Again, p o z —>■ y need not be a valid path, but it is a better candidate than r o v y is, and as r o v -/+ y is in simple 
contradiction, this suffices. 

(2) Our definition is stricter than many popular ones, in the following sense: We require - according to our general picture 
to treat only direct links as information - that the preclusion "hits" the precluded path at the end, i.e. v -f* y e T, and p' 
hits t o v t4 y at v. In other definitions, it is possible that the preclusion hits at so me v' , which is so mewhere on the path 
r, and not necessarily at its end. For instance, in the Tweety Diagram, see Diagram l9.2.1l f page [TT4")) . if there were a node 
b 1 between b and d, we will need the path c — > b — > b' to be valid, (obvious) validity of the arrow c-^b will not suffice. 

(3) If we allow p to be the empty path, then the case z = x is a subcase of the present one. 

(4) Our conceptual analysis has led to a very important simplification of the definition of validity. If we adopt on-path 
preclusion, we have to remember all paths which led to the information source to be considered: In the Tweety diagram, 
we have to remember that there is an arrow a — > 6, it is not sufficient to note that we somehow came from a to & by a 
valid path, as the path a —> c — » b — > d is precluded, but not the path a — > b — > d. If we adopt total validity preclusion, see 
also Section 19.2.31 (page 118 1[) for more discussion, we have to remember the valid path a — » c — » 6 to see that it precludes 
a — > c — > d. If we allow preclusion to "hit" below the last node, we also have to remember the entire path which is precluded. 
Thus, in all those cases, whole paths (which can be very long) have to be remembered, but NOT in our definition. 

We only need to remember (consider the Tweety diagram): 

(a) we want to know if a — > b — > d is valid, so we have to remember a, b, d. Note that the (valid) path from a to b can be 
composed and very long. 

(b) we look at possible preclusions, so we have to remember a — ► c -/* d, again the (valid) path from a to c can be very 
long. 

(c) we have to remember that the path from c to b is valid (this was decided by induction before). 

So in all cases (the last one is even simpler), we need only remember the starting node, a (or c), the last node of the valid 
paths, b (or c), and the information b ^ d or c-/+ d - i.e. the size of what has to be recalled is < 3. (Of course, there may 
be many possible preclusions, but in all cases we have to look at a very limited situation, and not arbitrarily long paths.) 

We take a fast look forward to Section 4.3, where we describe diagrams as information and its transfer, and nodes also as 
truth values. In these terms - and the reader is asked to excuse the digression - we may note above point (a) as a =H d - 
expressing that, seen from a, d holds with truth value 6, (b) as a ^> c ->d, (c) as c ==> c b - and this is all we need to know. 
□ 



We indicate here some modifications of the definition without discussion, which is to be found below. 

(1) For on-path preclusion only: Modify condition (2) in Case II. 1 to: (2') There is no v on the path a (i.e. a : x ■ ■ ■ — > 
V u) such that v y G T. 

(2) For total validity preclusion: Modify condition (2) in Case II. 1 to: (2') There are no v, r, r' such that v y £ T and 
t := x v and r' := v ■ ■ ■ — ► u such that r |= r o r'. 

(3) For extension based approaches: Modify condition (3) in Case II. 1 as follows: (3') If there are conflicting paths, which 
are not precluded themselves by contradictory information, then we branch recursively (i.e. for all such situations) into 
two extensions, one, where the positive non-precluded paths are valid, one, where the negative non-precluded paths are 
valid. 



Definition 9.2.5 

Finally, define T (= xy iff there is a : x — > y s.th. T \= a, likewise for xy and a : x ■•■-/* y. 



Diagram 19.2.31 fpage 1 1 T9|) shows the most complicated situation for the positive case. 
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V 




x 

Diagram 9.2.3 The complicated case 

We have to show now that the above approach corresponds to the preceeding discussion. 
Fact 9.2.2 

The above definition and the informal one outlined in Definition 19.2.21 (page I177[) correspond, when we consider valid 
positi ve pa ths a s access to information an d com paris on of information strength as indicated at the beginning of Section 
l9~2~2l (page [TT6]) and elaborated in Section l9~43l (page [T86]) . 

Proof 

As Definition 19.2.21 (page I177|) is informal, this cannot be a formal proof, but it is obvious how to transform it into one. 

We argue for the result, the argument for valid paths is similar. 

Consider then case (P2.2) in Definition 19.2.21 (page 1 1 77() . and start from some x. 

9.2.2.0.6 Case 1: 

Direct links, x — > z or x -/-> z. 

By comparison of strength via preclusion, as a direct link starts at x, the information z or ->z is stronger than all other 
accessible information. Thus, the link and the information will be valid in both approaches. Note that we assumed T free 
from hard contradictions. 

9.2.2.0.7 Case 2: 

Composite paths. 

In both approaches, the initial segment has to be valid, as information will otherwise not be accessible. Also, in both 
approaches, information will have the form of direct links from the accessible source. Thus, condition (1) in Case II. 1 
corresponds to condition (1) in Definition 19.2.21 fpage 1 177)) . 

In both approaches, information contradicted by a stronger source (preclusion) is discarded, as well as information which 
is contradicted by other, not precluded sources, so (P2.2) in Definition 19.2.21 fpage 1177)) and II. 1 (2) + (3) correspond. Note 
that variant (P2.1) of Definition 19.2.21 fpage [TTTj) would give a different result - which we could, of course, also imitate in 
a modified inheritance approach. 

9.2.2.0.8 Case 3: 

Other information. 

Inheri tanc e nets give no other information, as valid information is deduced only through val id pat hs by Defi nition 19.2.51 
(page I179P . And we did not add any other information either in the approach in Definition 19.2.21 (page 1 1 77|) . But as is 
obvious in Case 2, valid paths coincide in both cases. 

Thus, both approaches are equivalent. 

□ 
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9.2.3 Review of other approaches and problems 

We now discuss shortly in more detail some ol the differences between various major definitions of inheritance formalisms. 

Diagram 6.8, p. 179, in [Sch97-2 (which is probably due to folklore of the field) shows requiring downward chaining would 
be wrong. We repeat it here, see Diagram 19.2.41 fpage ll81[) . 



The problem of downward chaining 



y 




z 

Diagram 9.2.4 



Preclusions valid above (here at u) can be invalid at lower points (here at z), as part of the relevant information is no 
longer accessible (or becomes accessible). We have u — » x /> y valid, by downward chaining, any valid path z — > u . . . .y 
has to have a valid final segment u . . . y, which can only be u — > x -f^ y, but intuition says that z — * u — ► v — ► y should be 
valid. Downward chaining prevents such changes, and thus seems inadequate, so we decide for upward chaining. (Already 
preclusion itself underlines upward chaining: In the Tweety diagram, we have to know that the path from bottom up 
to penguins is valid. So at least some initial subpaths have to be known - we need upward chaining.) (The rejection of 
downward chaining seems at first sight to be contrary to the intuitions carried by the word "inheritance" .) See also the 
remarks in Section 19.4.41 (page 1 1 88[) . 

9.2.3.0.9 Extension-based versus directly skeptical definitions 

As this distinction has already received detailed discussion in the literature, we shall be very brief here. An extension 
of a net is essentially a maximally consistent and in some appropriate sense reasonable subset of all its potential paths. 
This can of course be presented either as a liberal conception (focussing on individual extensions) or as a skeptical one 
(focussing on their intersection - or, the intersection of their conclusion sets). The seminal presentation is that of [Tou86], 
as refined by [San86]. The directly skeptical approach seeks to obtain a notion of skeptically accepted path and conclusion, 
but without detouring through extensions. Its classic presentation is that of [HTT87]. Even while still searching for fully 
adequate definitions of cither kind, we may use the former approach as a useful "control" on the latter. For if we can 
find an intuitively possible and reasonable extension supporting a conclusion xy, whilst a proposed definition for a directly 
skeptical notion of legitimate inference yields xy as a conclusion, then the counterexemplary extension seems to c all into 
question the adequacy of the directly skeptical construction, more readily than inversely. It has been shown in [Sch93] 
that th e inter sectio n of extensions is fundamentally different from the directly sceptical approach. See also the remark in 
Section E331 (page [188]). 

From now on, all definitions considered shall be (at least) upward chaining. 
9.2.3.0.10 On-path versus off-path preclusion 

This is a rather technical distinction, discussed in [THT87]. Briefly, a path a: x—*...—*y—*...—*z and a direct link 
y ■/+ u is an off-path preclusion of r: x —►...—> z — > ... — » u, but an on-path preclusion only iff all nodes of r between x 
and z lie on the path a. 

For instance, in the Tweety diagram, the arrow c -/* d is an on-path preclusion of the path a — > c — > b — > d, but the paths 
a c and c — > b, together with c -/+ d, is an (split validity) off-path preclusion of the path a — > b — > d. 
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Consider again a preclusion er : it x — ►...—»•«, and x-/*yofr: u—>...^v^>...—>y. Most definitions demand 

for the preclusion to be effective - i.e. to prevent r from being accepted - that the total path a is valid. Some ([GV89], 
[KK89], [KKW89a], [KKW89b]) content themselves with the combinatorially simpler separate (split) validity of the lower 
and upper parts of a: a' : u x and a" : x v. In Diagram 19. 2. 51 fpage [T8"2"|) . taken from [Sch97-2 , the path 

x — > w — ► v is valid, so is u — > x, but not the whole preclusion path u —>■ x ~> w —>■ v. 



Split vs. total validity preclusion 




Diagram 9.2.5 

Thus, split validity preclusion will give here the definite result uy. With total validity preclusion, the diagram has essentially 
the form of a Nixon Diamond. 

9.3 Defeasible inheritance and reactive diagrams 

Before we discuss the relationship in detail, we first summarize our algorithm. 

9.3.1 Summary of our algorithm 

We look for valid paths from x to y. 

(1) Direct arrows are valid paths. 

(2) Consider the set C of all direct predecessors of y, i.e. all c such that there is a direct link from c to y. 

(2.1) Eliminate all those to which there is no valid positive path from x (found by induction), let the new set be C C C. 
If the existence of a valid path has not yet been decided, we have to wait. 

(2.2) Eliminate from C all c such that there is d S C and a valid positive path from d to c (found by induction) - unless 
the arrows from c and from d to y are of the same type. Let the new set be C" C C' (this handles preclusion). 

If the existence of such valid paths has not yet been decided, we have to wait. 

(2.3) If the arrows from all elements of C" to y have same polarity, we have a valid path from x to y, if not, we have an 
unresolved contradiction, and there is no such path. 

Note that we were a bit sloppy here. It can be debated whether preclusion by some d such that c and d have the same 
type of arrow to y should be accepted. As we are basically only interested whether there is a valid path, but not in its 
details, this does not matter. 



9.3.2 Overview 



There are several ways to use reactive graphs to help us solve inheritance diagrams - but also to go beyond them. 
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(2) We go deeper into the calculating mechanism, and do not only use new arrows as memory, but also for calculation. 

(3) We can put up "sign posts" to mark dead ends, in the following sense: If we memorize valid paths from x to y, then, 
anytime we are at a branching point u coming from x, trying to go to y, and there is valid path through an arrow 
leaving u, we can put up a sign post saying "no valid path from x to y through this arrow" . 

Note that we have to state destination y (of course), but also outset, x : There might be a valid path from u to y, 
which may be precluded or contradicted by some path coming from x. 

(4) We can remember preclusion, in the following sense: If we found a valid positive path from a to 6, and there are 
contradicting arrows from a and b to c, then we can create an arrow from a to the arrow from b to c. So, if, from x, 
we can reach both a and b, the arrow from 6 to c will be incapacitated. 

Before we discuss the first three possibilities in detail we shortly discuss the more general picture (in rough outline). 

(1) Replacing labels by arrows and vice versa. 

As we can switch arrows on and off, an arrow carries a binary value - even without nay labels. So the idea is obvious: 
If an arrow has 1 label with n possible values, we can replace it with n parallel arrows (i.e. same source and 
destination), where we switch eaxctly one of them on - this is the label. 

Conversely, we can replace n parallel arrows without labels, where exactly one is active, by one arrow with n labels. 
We can also code labels of a node x by an arrow a : x — ► x, which has the same labels. 

(2) Coding logical formulas and truth in a model 

We take two arrows for each propositional variable, one stands for true, the other for false. Negation blocks the 
positive arrow, enables the negative one. Conjunction is solved by concatenation, disjunction by parallel paths. If a 
variable occurs more than once, we make copies, which are "switched" by the "master arrow" . 

We come back to the first three ways to treat inheritance by reactive graphs, and also mention a way to go beyond usual 
inheritance. 



9.3.3 Compilation and memorization 

When we take a look at the algorithm deciding which potential paths are valid, we see that, with one exception, we only 
need the results already obtained, i.e. whether there is a valid positive/negative path from a to 6, and not the actual paths 
themselves. (This is, of course, due to our split validity definition of preclusion.) The exception is that preclusion works 
"in the upper part" with direct links. But this is a local problem: we only have to look at the direct predecessors of a 
node. 

Consequently, we can do most of the calculation just once, in an induction of increasing "path distance" , memorize valid 
positive (the negative ones cannot be used) paths with special arrows which we activated once their validity is established, 
and work now as follows with the new arrows: 

Suppose we want to know if there is a valid path from x to y. We look backwards at all predecessors b of y (using a simple 
backward pointer), and look whether there is a valid positive path from x to 6, using the new arrows. We then look at all 
arrows going from such b to y. If they all agree (i.e. all are positive, or all are negative), we need not look further, and have 
the result. If they disagree, we have to look at possible comparisons by specificity. For this, we see whether there are new 
arrows between the 6's. All such b to which there is a new arrow from another b are out of consideration. If the remaining 
agree, we have a valid path (and activate a new arrow from x to y if the path is positive), if not, there is no such path. 
(Depending on the technical details of the induction, it can be useful to note this also by activating a special arrow.) 

9.3.4 Executing the algorithm 

Consider any two points x, y. 

There can be no path, a positive potential path, a negative potential path, both potential paths, a valid positive path, a 
valid negative path (from x to y). 

Once a valid path is found, we can forget potential paths, so we can code the possibilities by ,fH — ,v+, v— }, in 
above order. 

We saw that we can work either with labels, or with a multitude of parallel arrows, we choose the first possibility. 
We create for each pair of nodes a new arrow, (x,y), which we intialize with label *. 
First, we look for potential paths. 

If there is a direct link from x to y, we change the value * of (x, y) directly to v+ or v — . 

If (x, y') has value p+ or p H — or v+, and there is a direct link from y' to y, we change the value of (x, y) from * to p+ 
or p— , depending on the link from y' to y, from p+ or p- to p H — if adequate (we found both possibilities), and leave the 
value unchanged otherwise. 

This determines all potential paths. 

We make for each pair x, y at y a list of its predecessors, i.e. of all c s.t. there is a direct link from c to y. We do this for 
all x, so we can work in parallel. A list is, of course, a concatenation of suitable arrows. 

Suppose we want to know if there is valid path from x to y. 

First, there mieht be a direct link, and we are done. 
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If one c in the list has value *, p— , v- for [x, c), it is eliminated from the list. If one c has p+ or p -\ — , we have to wait. 
We do this until all (x, c) have *, v+ or v—, so those remaining in the list will have v + . 

We look at all pairs in the list. While at least one (c, c') has p+ or p H — , we have to wait. Finally, all pairs will have *, 
p—, v+ or v — . Eliminate all d s.t. there is (c, c') with value v+ - unless the arrows from c and c' to y are of the same 
type. 

Finally, we look at the list of the remaining predecessors, if they all have the same link to y, we set (x,y) to v+ or v— , 
otherwise to *. 

All such operations can be done by suitable operations with arrows, but it is very lenghty and tedious to write down the 
details. 



9.3.5 Signposts 

Putting up signposts requires memorizing all valid paths, as leaving one valid path does not necessarily mean that there 
is no alternative valid path. The easiest thing to do this is probably to put up a warning post everywhere, and collect the 
wrong ones going backwards through the valid paths. 

We illustrate this with Diagram [5.3.11 f page 1 1 84|) : 



v 




X 

Diagram 9.3.1 

There are the following potential paths from x to y : xcy, xceb-y, xcebday, xay. 

The paths xc, xa, and xceb are valid. The latter: xce is valid, xceb is in competition with xcg-b and xceg-b, but both are 
precluded by the arrow (valid path) eg. 

y's predecessors on those paths are a, b, c. 
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b and c are comparable, as the path ceb is valid, since cg-b and ceg-b are precluded by the valid path (arrow) eg. So c is 
more specific than b. 

Thus, b is out of consideration, and we are left with a and c. They agree, so there is positive valid path from x to y, more 
precisely, one through a, one through c. 

We have put STOP signs on the arrows ce and eg, as we cannot continue via them to y. 

9.3.6 Beyond inheritance 

But we can also go beyond usual inheritance networks. 
Consider the following scenario: 

• Museum airplanes usually will not fly, but usual airplanes will fly. 

• Penguins don't fly, but birds do. 

• Non- flying birds usually have fat wings. 

• Non-flying aircraft usually are rusty. 

• But it is not true that usually non- flying things are rusty and have fat wings. 
We can model this with higher order arrows as follows: 

• Penguins — > birds, museum airplanes — > airplanes, birds — > fly, airplanes — » fly. 

• Penguins fly, museum airplanes -/-> fly. 

• Flying objects -/-> rusty, flying objects -/-> have fat wings. 

• We allow concatenation of two negative arrows: 

Coming e.g. from penguins, we want to concatenate penguins -/* fly and fly -/* fat wings. Coming from museum 
aircraft, we want to concatenate museum airfcraft -/-> fly and fly -/* rusty. 

We can enable this as follows: we introduce a new arrow a : (penguin ■/* fly) — * (fly -/+ fat wings), which, when 
traversing penguin fly enables the algorithm to concatenate with the arrow it points to, using the rule " — *— = +", 
giving the result that penguins usually have fat wings. 

See |Gab08cj for deeper discussion. 

9.4 Interpretations 

9.4.1 Introduction 

We will discuss in this Section three interpretations of inheritance nets. 

First, we w ill ind icate fundamental differences between inheritance and the systems P and R. They will be elaborated in 
Section f9.5l (Dage ll90p . where an interpretation in terms of small sets will be tried nonetheless, and its limitations explored. 

Second, we will interpret inheritance nets as systems of information and information flow. 

Third, we will interpret inheritance nets as systems of prototypes. 

Inheritance nets present many intuitively attractive properties, thus it is not surprising that we can interpret them in 
several ways. Similarly, preferential structures can be used as a semantics of deontic and of nonmonotonic logic, they 
express a common idea: choosing a subset of models by a binary relation. Thus, such an ambiguity need not be a sign for 
a basic flaw. 

9.4.2 Informal comparison of inheritance with the systems P and R 
9.4.2.0.12 The main issues 

In the authors' opinion, the following two properties of inheritance diagrams show the deepest difference to preferential 
and similar semantics, and the first even to classical logic. They have to be taken seriously, as they are at the core of 
inheritance systems, are independent of the particular formalism, and show that there is a fundamental difference between 
the former and the iatter. Consequently, any attempt at translation will have to stretch one or both sides perhaps beyond 
the breaking point. 

(1) Relevance, 

(2) subideal situations, or relative normality 

Both (and more) can be illustrated by the following simple Diagram l9.4.1l fpage ll86|) (which also shows conflict resolution 
by specificity). 

(1) Relevance: As there is no monotonous path whatever between e and d, the question whether e's are d's or not, or vice 
versa, does not even arise. For the same reason, there is no question whether b's are c's, or not. (As a matter of fact, we 
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Of course, in classical logic, all information is relevant to the rest, so we can say e.g. that e's are d's, or e's are non — d's, 
or some are d's, some are not, but there is a connection. As preferential models are based on classical logic, the same 
argument applies to them. 

(2) In our diagram, a's are b's, but not ideal b's, as they are not d's, the more specific information from c wins. But they 
are e's, as ideal b's are. So they are not perfectly ideal b's, but as ideal b's as possible. Thus, we have graded ideality, 
which does not exist in preferential and similar structures. In those structures, if an element is an ideal element, it has all 
properties of such, if one such property is lacking, it is not ideal, and we can't say anything any more. Here, however, we 
sacrifice as little normality as possible, it is thus a minimal change formalism. 

In comparison, questi ons of inf orm ation transfer and strength of information seem lesser differences. Already systems P 
and R (see Definition 12.31 (page [3D]) ) differ on information transfer. In both cases, transfer is based on the same notion of 
smallness, which describes ideal situations. But, as said in Remark l3.2.1l (pagel4"8 [) , this is conceptually very different from 
the use of smallness, describing normal situations. Thus, it can be considered also on this level an independent question, and 
we can imagine systems based on absolutely ideal situations for normality, but with a totally different transfer mechanism. 
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Diagram 9.4.1 Information transfer 



For these reasons, extending preferential and related semantics to cover inheritance nets seems to stretch them to the 
breaking point, Thus, we should also look for other interpretations. (The term "interpretation" is used here in a non- 
technical sense.) In particular, it seems worth while to connect inheritance systems to other problems, and see whether 
there are similarities there. This is what we do now. We come back to the systems P and R in Section [531 fpage [TDD)) . 

Note that Reiter Defaults behave much more like inheritance nets than like preferential logics. 
9.4.3 Inheritance as information transfer 

An informal argument showing parallel ideas common to inheritance with an upward chaining formalism and information 
transfer is as follows: First, arrows represent certainly some kind of information, of the kind "most a's are b's" or so. (See 
Diagram 19.4.11 fpage [TSDj) .) Second, to be able to use information, e.g. "d's are f's" at a, we have to be able to connect 
from a to d by a valid path, this information has to be made accessible to a, or, in other terms, a working information 
channel from a to d has to be established. Third, specificity (when present) decides conflicts (we take the split validity 
approach). This can be done procedurally, or, perhaps simpler and certainly in a more transparent way, by assigning a 
comparison of information strength to valid paths. Now, information strength may also be called truth value (to use a 
term familiar in logic) and the natural entity at hand is the node itself - this is just a cheap formal trick without any 
conceptual meaning. 

When we adopt this view, nodes, arrows, and valid paths have multiple functions, and it may seem that we overload the 
(deceptively) simple picture. But it is perhaps the charm and the utility and naturalness of inheritance systems that they 
are not "clean" , and hide many complications under a simple surface, as human common sense reasoning often does, too. 
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opinion. Moreover, our analysis makes a clear distinction between arrows and composite valid paths. This distinction is 
implicit in inheritance formalisms, we make it explicit through our concepts. 

But this interpretation is by no means the only one, and can only be suggested as a possibility. 

We will now first give the details, and then discuss our interpretation. 



9.4.3.0.13 (1) Information: 

Direct positive or negative arrows represent information, valid for their source. Thus, in a set reading, if there is an arrow 
A — > B in the diagram, most elements of A will be in B, in short: "most A's are B's" - and A -f* B will mean that most 
A's are not B's. 



9.4.3.0.14 (2) Information sources and flow: 

Nodes are information sources. If A — > B is in the diagram, A is the source of the information "most A's are B's" . 

A valid, composed or atomic positive path a from U to A makes the information of source A accessible to U. One can also 
say that .A's information becomes relevant to U. Otherwise, information is considered independent - only (valid) positive 
paths create the dependencies. 

(If we want to conform to inheritance, we must not add trivialities like "x's are x's" , as this would require x — ► x in the 
corresponding net, which, of course, will not be there in an acyclic net.) 

9.4.3.0.15 (3) Information strength: 

A valid, composed or atomic positive path a from A' to A allows us to compare the strength of information source A' with 
that of A : A' is stronger than A. (In the set reading, this comparison is the result of specificity: more specific information 
is considered more reliable.) If there is no such valid path, we cannot resolve contradictions between information from 
A and A'. This interpretation results in split validity preclusion: the comparison between information sources A' and A 
is absolute, and does NOT depend on the U from which both may be accessible - as can be the case with total validity 
preclusion. Of course, if desired, we can also adopt the much more complicated idea of relative comparison. 

Nodes are also truth values. They are the strength of the information whose source they are. This might seem an abuse 
of nodes, but we already have them, so why not use them? 

9.4.3.0.16 IS-Discussion: 

Considering direct arrows as information meets probably with little objection. 

The conclusion of a valid path (e.g. if a : a ... — » b is valid, then its conclusion is "a's are b's") is certainly also information, 
but it has a status different from the information of a direct link, so we should distinguish it clearly. At least in upward 
chaining formalisms, using the path itself as some channel through which information flows, and not the conclusion, seems 
more natural. The conclusion says little about the inner structure of the path, which is very important in inheritance 
networks, e.g. for preclusion. When calculating validity of paths, we look at (sub- and other) paths, but not just their 
results, and should also express this clearly. 

Once we accept this picture of valid positive paths as information channels, it is natural to see their upper ends as 
information sources. 

Our interpretation supports upward chaining, and vice versa, upward chaining supports our interpretation. 

One of the central ideas of inheritance is preclusion, which, in the case of split validity preclusion, works by an absolute 
comparison between nodes. Thus, if we accept split validity preclusion, it is natural to see valid positive paths as compar- 
isons between information of different strengths. Conversely, if we accept absolute comparison of information, we should 
also accept split validity preclusion - these interpretations support each other. 

Whatever type of preclusion we accept, preclusion clearly compares information strength, and allows us to decide for 
the stronger one. We can see this procedurally, or by giving different values to different information, depending on their 
sources, which we can call truth values to connect our picture to other areas of logic. It is then natural - as we have it 
already - to use the source node itself as truth value, with comparison via valid positive paths. 



9.4.3.0.17 Illustration: 

Thus, in a given node U, information from A is accessible iff there is a valid positive path from U to A, and if information 
from A 1 is also accessible, and there is a valid positive path from A' to A, then, in case of confli ct, inf orma tion from A' 
wins over that from A, as A' has a better truth value. In the Tweety diagram, see Diagram 19.2. II (page [TT4")) , Tweety has 
access to penguins and birds, the horizontal link from penguin to bird compares the strengths, and the fly/not fly arrows 
are the information we are interested in. 

Note that negative links and (valid) paths have much less function in our picture than positive links and valid paths. In 
a way, this asymmetry is not surprising, as there are no negative nodes (which would correspond to something like the 
set complement or negation). To summarize: A negative direct link can only be information. A positive direct link is 
information at its source, but it can also be a comparison of truth values, or it can give access from its source to information 
at its end. A valid positive, composed path can only be comparison of truth values, or give access to information, it is 
NOT information itself in the sense of direct links. This distinction is very important, and corresponds to the different 
treatment of direct arrows and valid paths in inheritance, as it appears e.g. in the definition of preclusion. A valid negative 
composed path has no function, only its parts have. 



We obtain automatically that direct information is stronger than any other information: If A has information (j>, and there 

• !• 1 r_ A j_ _ n l • T->!_ • r j_ _ A j_i i Af~ 
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Inheritance diagrams in this interpretation represent not only reasoning with many truth values, but also reasoning about 
those truth values: their comparison is done by the same underlying mechanism. 

We should perhaps note in this context a connection to an area currently en vogue: the problem of trust, especially in the 
context of web information. We can see our truth values as the degrees of trust we put into information coming from this 
node. And, we not only use, but also reason about them. 



9.4.3.0.18 Further comments: 

Our reading also covers enriched diagrams, where arbitrary information can be "appended" to a node. 

An alternative way to see a source of information is to see it as a reason to believe the information it gives. U needs a 
reason to believe something, i.e. a valid path from U to the source of the information, and also a reason to disbelieve, i.e. 
if U' is below U, and U believes and U' does NOT believe some information of A, then either U' has stronger information 
to the contrary, or there is not a valid path to A any more (and neither to any other possible source of this information). 
("Reason" , a concept very important in this context, was introduced by A.Bochman into the discussion.) 

The restriction that negative links can only be information applies to traditional inheritance networks, and the authors 
make no claim whatever that it should also hold for modified such systems, or in still other contexts. One of the reasons 
why we do not have "negative nodes" , and thus negated arrows also in the middle of paths might be the following (with 
C complementation): If, for some X, we also have a node for CX, then we should have X -f* CX and CX y4 X, thus a 
cycle, and arrows from Y to X should be accompanied by their opposite to CX, etc. 

We translate the analysis and decision of Defmition l9 . 2 . 21 (page [T77|) now into the picture of information sources, accessibility, 
and comparison via valid paths. This is straightforward: 

(1) We have that information from A\, i £ I, about B is accessible from U, i.e. there are valid positive paths from U to all 
Aj. Some A4 may say ->B, some B. 

(2) If information from Ai is comparable with information from Aj (i.e. there is a valid positive path from Ai to Aj or the 
other way around), and Ai contradicts Aj with respect to B, then the weaker information is discarded. 

(3) There remains a (nonempty, by lack of cycles) set of the Ai, such that for no such A, there is Aj with better contradictory 
information about B. If the information from this remaining set is contradictory, we accept none (and none of the paths 
either), if not, we accept the common conclusion and all these paths. 

We continue now Remark 19.2.11 (page I1T9[) , (4) , and turn this into a formal system. 

Fix a diagram T, and do an induction as in Definition 19.2.21 f page [T77|) . 

Definition 9.4.1 

(1) We distinguish a => b and a =£- x 6, where the intuition of a => x b is: we know with strength x that a's are b's, and of 
a =>■ b that it has been decided taking all information into consideration that a => b holds. 

(We introduce this notation to point informally to our idea of information strength, and beyond, to logical systems with 
varying strengths of implications.) 

(2) a — > b implies a => a b, likewise a -/-> b implies a => a ->b. 

(3) a => a b implies a => 6, likewise a => a ->b implies a => ->b. This expresses the fact that direct arrows are uncontested. 

(4) a => b and b =H c imply a =>-& c, likewise for b =^ -ic. This expresses concatenation - but without deciding if it is 
accepted! Note that we cannot make (a =>■ b and b =>■ c imply a =>b c) a rule, as this would make concatenation of two 
composed paths possible. 

(5) We decide acceptance of composed paths as in Definition 19.2.31 (page [TT5]) . where preclusion uses accepted paths for 
deciding. 

Note that we reason in this system not only with, but also about relative strength of truth values, which are just nodes, 
this is then, of course, used in the acceptance condition, in preclusion more precisely. 



9.4.4 Inheritance as reasoning with prototypes 

Some of the issues we discuss here apply also to the more general picture of information and its transfer. We present them 
here for motivational reasons: it seems easier to discuss them in the (somewhat!) more concrete setting of prototypes than 
in the very general situation of information handling. These issues will be indicated. 

It seems natural to see information in inheritance networks as information about prototypes. (We do not claim that our use 
of the word "prototype" has more than a vague relation to the use in psychology. We do not try to explain the usefulness 
of prototypes either, one possibility is that there are reasons why birds fly, and why penguins don't, etc.) In the Tweety 
diagram, we will thus say that prototypical birds will fly, prototypical penguins will not fly More precisely, the property 
"fly" is part of the bird prototype, the property "-'fly" part of the penguin prototype. Thus, the information is given for 
some node, which defines its application or domain (bird or penguin in our example) - beyond this node, the property is 
not defined (unless inherited, of course). It might very well be that no element of the domain has ALL the properties of 
the prototype, every bird may be exceptional in some sense. This again shows that we are very far from the ideal picture 
of small and big subsets as used in systems P and R. (This, of course, goes beyond the problem of prototypes.) 

Of course, we will want to "inherit" properties of prototypes, for instance in Diagram 19.4. II (page llSGp . a "should" inherit 
the property e from b, and the property ->d from c. Informally, we will argue as follows: "Prototypical a's have property b, 
and prototypical b's have property e, so it seems reasonable to assume that prototypical a's also have property e - unless 
there is better information to the contrary." A plausible solution is then to use upward chaining inheritance as described 
above to find all relevant information, and then compose the orototvoe. 
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(1) Using upward chaining has an additional intuitive appeal: We consider information at a the best, so we begin with b 
(and c), and only then, tentatively, add information e from b. Thus, we begin with strongest information, and add weaker 
information successively - this seems good reasoning policy. 

(2) In upward chaining, we also collect information at the source (the end of the path), and do not use information which 
was already filtered by going down - thus the information we collect has no history, and we cannot encounter problems of 
iterated revision, which are problems of history of change. (In downward chaining, we only store the reasons why something 
holds, but not why something does not hold, so we cannot erase this negative infor mation whe n the reason is not valid any 
more. This is an asymmetry apparently not much noted before. Consider Diagram l9.2.4l (page 118 J]l . Here, the reason why 
u does not accept y as information, but ->y, is the preclusion via x. But from z, this preclusion is not valid any more, so 
the reason why y was rejected is not valid any more, and y can now be accepted.) 

(3) We come back to the question of extensions vs. direct scepticism. Consider the Nixon Diamond, Diagram 19.2.21 (page 
I175[) . Suppose Nixon were a subclass of Republican and Quaker. Then the extensions approach reasons as follows: Either 
the Nixon class prototype has the pacifist property, or the hawk property, and we consider these two possibilities. But this 
is not sufficient: The Nixon class prototype might have neither property - they are normally neither pacifists, nor hawks, 
but some are this, some are that. So the conceptual basis for the extensions approach does not hold: "Tertium non datur" 
just simply does not hold - as in Intuitionist Logic, where we may have neither a proof for (j>, nor for -i<j>. 

Once we fixed this decision, i.e. how to find the relevant information, we can still look upward or downward in the net and 
investigate the changes between the prototypes in going upward or downward, as follows: E.g., in above example, we can 
look at the node a and its prototype, and then at the change going from a to b, or, conversely, look at b and its prototype, 
and then at the change going from b to a. The problem of finding the information, and this dynamics of information change 
have to be clearly separated. 

In both cases, we see the following: 

(1) The language is kept small, and thus efficient. 

For instance, when we go from a to b, information about c is lost, and "c" does not figure any more in the language, but 
/ is added. When we go from b to a, / is lost, and c is won. In our simple picture, information is independent, and 
contradictions are always between two bits of information. 

(2) Changes are kept small, and need a reason to be effective. Contradictory, stronger information will override the old 
one, but no other information, except in the following case: making new information (in-) accessible will cause indirect 
changes, i.e. information now made (in-) accessible via the new node. This is similar to formalisms of causation: if a reason 
is not there any more, its effects vanish, too. 

It is perhaps more natural when going downward also to consider "subsets", as follows: Consider Diagram l9.4.1l fpagc ll86|) . 
b's are d's, and c's are -id's, and c's are also b's. So it seems plausible to go beyond the language of inheritance nets, and 
conclude that b's which are not c's will be d's, in short to consider (b — c) s. It is obvious which such subsets to consider, 
and how to handle them: For instance, loosely speaking, in b n d e will hold, in b n c n d ->/ will hold, in b n d (~l Cc f will 
hold, etc. This is just putting the bits of information together. 

We turn to another consideration, which will also transcend the prototype situation and we will (partly) use the intuition 
that nodes stand for sets, and arrows for (soft, i.e. possibly with exceptions) inclusion in a set or its complement. 

In this reading, specificity stands for soft, i.e. p ossibly with exce ptions, set inclusion. So, if b and c are visible from a, and 
there is a valid path from c to b (as in Diagram l9.4.1l (page [T86|) ), then a is a subset both of b and c, and c a subset of b, 
so a C c C b (softly). But then a is closer to c than a is to b. Automatically, a will be closest to itself. This results in a 
partial, and not necessarily transitive relation between these distances. 

When we go now from b to c, we lose information d and /, win information ->d, but keep information e. Thus, this is 
minimal change: we give up (and win) only the necessary information, but keep the rest. As our language is very simple, 
we can use the Hamming distance between formula sets here. (We will make a remark on more general situations just 
below.) 

When we look now again from a, we take the set-closest class (c), and use the information of c, which was won by minimal 
change (i.e. the Hamming closest) from information of b. So we have the interplay of two distances, where the set distance 
certainly is not symmetrical, as we need valid paths for access and comparison. If there is no such valid path, it is reasonable 
to make the distance infinite. 

We make now the promised remark on more general situations: in richer languages, we cannot count formulas to determine 
the Hamming distance between two situations (i.e. models or model sets), but have to take the difference in propositional 
variables. Consider e.g. the language with two variables, p and q. The models (described by) pAq andpA^q have distance 
1, whereas p A q and ->p A -*q have distance 2. Note that this distance is NOT robust under re-definition of the language. 
Let p' stand for (p A q) V (->p A -■<?), and q' for q. Of course, p' and q' are equivalent descriptions of the same model set, as 
we can define all the old singletons also in the new language. Then the situations p Aq and ~^p A ~^q have now distance 1, 
as one corresponds to p' A q , the other to p' A -xf. 

There might be misunderstandings about the use of the word "distance" here. The authors are fully aware that inheritance 
networks cannot be captured by distance semantics in the sense of preferential structures. But we do NOT think here of 
distances from one fixed ideal point, but of relativized distances: Every prototype is the origin of measurements. E.g., the 
bird prototype is defined by "flying, laying eggs, having feathers So we presume that all birds have these properties 

of the prototype, i.e. distance from the prototype. When we see that penguins do not fly, we move as little as possible 
from the bird prototype, so we give up "flying", but not the rest. Thus, penguins (better: the penguin prototype) will 
have distance 1 from the bird prototype (just one property has changed). So there is a new prototype for penguins, 
and considering penguins, we will not measure from the bird prototype, but from the penguin prototyp e, so the point 
of reference changes. This is exactly as in distance semantics for theory revision, introduced in LMS01 , only the point 
of reference is not the old theory T, but the old prototype, and the distance is a very special one, counting properties 
assumed to be independent. (The picture is a little bit more complicated, as the loss of one property (flying) may cause 
other modifications, but the simple picture suffices for this informal argument.) 



We conclude this Section with a remark on prototypes. 
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not begin with the patient's most specific category (name and birthday or so), nor will he begin with all he knows about 
general objects. Therefore, it seems reasonable to investigate upward and downward reasoning here. 



9.5 Detailed translation of inheritance to modified systems of small sets 

For background material on abstract size semantics, the reader is referred to Chapter [3] (page 23]). 



9.5.1 Normality 

As we saw already in Section [9.4.21 (page [T55]) , normality in inheritance (and Reiter defaults etc.) is relative, and as much 
normality as possible is preserved. There is no set of absolute normal cases of X, which we might denote N(X), but only 
for <j) a set N(X, 0), elements of X, which behave normally with respect to <f>. Moreover, N(X, <fi) might be defined, but not 
N(X, ip) for different <j> and ip. Normality in the sense of preferential structures is absolute: if x is not in N(X) (= fJ,(X) in 
preferential reading), we do not know anything beyond classical logic. This is the dark Swedes' problem: even dark Swedes 
should probably be tall. Inheritance systems are different: If birds usually lay eggs, then penguins, though abnormal with 
respect to flying, will still usually lay eggs. Penguins are fly-abnormal birds, but will continue to be egg-normal birds - 
unless we have again information to the contrary. So the absolute, simple N(X) of preferential structures splits up into 
many, by default independent, normalities. This corresponds to intuition: There are no absolutely normal birds, each one 
is particular in some sense, so f]{N(X, (f) : <fi G £} may well be empty, even if each single N(X, <fi) is almost all birds. 

What are the laws of relative normality? N(X, 4>) and N(X, ip) will be largely independent (except for trivial situations, 
where <f> <-» ip, </> is a tautology, etc.). N(X,<f>) might be defined, and N(X,ip) not. Connections between the different 
normalities will be established only by valid paths. Thus, if there is no arrow, or no path, between X and Y, then N(X, Y) 
and N(Y, X) - where X, Y are also properties - need not be defined. This will get rid of the unwanted connections found 
with absolute normalities, as illustrated by Fact 19. 5. T1 fpage 11901) . 

We interpret now "normal" by "big set" , i.e. essentially " <\> holds normally in X" iff "there is a big subset of X, where <j> 
holds" . This will, of course, be modified. 



9.5.2 Small sets 

The main interest of this Section is perhaps to show the adaptations of the concept of small and big subsets necessary for 
a more "real life" situation, where we have to relativize. The amount of changes illustrates the problems and what can be 
done, but also perhaps what should not be done, as the concept is stretched too far. For more background, see Chapter [3] 
(pagel43|). 

As said, the usual informal way of speaking about inheritance networks (plus other considerations) motivates an interpre- 
tation by sets and soft set inclusion - A — > B means that "most A's are B's" . Just as with normality, the "most" will 
have to be relativized, i.e. there is a B— normal part of A, and a B— abnormal one, and the first is B— bigger than the 
second - where "bigger" is relative to B, too. A further motivation for this set interpretation is the often evoked specificity 
argument for preclusion. Thus, we will now translate our remarks about normality into the language of big and small 
subsets. 

Consider now the system P (with Cumulativity) , see Definition ^. 31 fpage lBT)]) . Recall from Remark [3 .2.11 f pagc l4"5|) that small 
sets (see Definition 13.2.2.61 (page l46l) ) are used in two conceptually very distinct ways: a (~ p iff the set of a A _i /3— cases 
is a small subset (in the absolute sense, there is just one system of big subsets of the a— cases) of the set of a— cases. The 
second use is in information transfer, used in Cumulativity, or Cautious Monotony more precisely: if the set of a A case s 
is a small subset of the set of a— cases, then a |~ P carries over toaA7:aA7 (See also the discussion in [Sch04 , 

page 86, after Definition 2.3.6.) It is this transfer which we will consider here, and not things like AND, which connect 
different N(X, <fi) for different <j>. 

Before we go into details, we will show that e.g. the system P is too strong to model inheritance systems, and that e.g. 
the system R is to weak for this purpose. Thus, preferential systems are really quite different from inheritance systems. 

Fact 9.5.1 

(a) System P is too strong to capture inheritance. 

(b) System R is too weak to capture inheritance. 



Proof 

(a) Consider the Tweety diagram, Diagram 19.2.11 (page [174"]) . c —> b —> d, c -/* d. There is no arrow b ^ c, and we will 
see that P forces one to be there. For this, we take the natural translation, i.e. X — > Y will be " X n Y is a big subset 
of X", etc. We show that c n b is a small subset of b, which we write c H b < b. c Pi b — (c n b n d) U (c n b f] Cd). 
c n b n Cd C b n Cd < b, the latter by b — > d, thus c n b n Cd < b, essentially by Right Weakening. Set now X := c n b n a. 
As c y4 d, X :— cHbodCcDdKc, and by the same reasoning as above X < c. It remains to show X < b. We use now 
c — > b. As c n Cb < c, and c n X < c, by Cumulativity X = cC\ X C\b < cC\b, so essentially by OR X = c(~) X Db < b. 
Using the filter property, we see that c n b < b. 

(b) Second, even R is too weak: In the diagram X — > Y — > Z, we want to conclude that most of X is in Z, but as X might 
also be a small subset of Y, we cannot transfer the information "most Y's are in Z" to X. 

□ 
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We have to distinguish direct information or arrows from inherited information or valid paths. In the language of big and 
small sets, it is easiest to do this by two types of big subsets: big ones and very big ones. We w ill de note the first big, the 
second BIG. This corresponds to the distinction between a =>• b and a ^ a b in Dcfinition l9.4.1l (page [188]). 

W e will hav e the impli cations B IG — ► big and SMALL — ► small, so we have nested systems. Such systems were discussed 
in |Sch95-lj . see also |Sch97-2j . This distinction seems to be necessary to prevent arbitrary concatenation of valid paths 
to valid paths, which would lead to contradictions. Consider e.g. a — > 6 — > c — * d, a —> e -/+ d, e — > c. Then concatenating 
a — > b with b — > c — > d, both valid, would lead to a simple contradiction with a — > e d, and not to preclusion, as it 
should be - see below. 

For the situation X — > Y — > Z, we will then conclude that: 

If Y (~l Z is a Z— BIG subset of Y and X n Y is a Y— big subset of X then X n Z is a Z— big subset of X. (We generalize 
already to the case where there is a valid path from X to Y.) 

We call this procedure information transfer. 

Y — > Z expresses the direct information in this context, so YflZ has to be a Z— BIG subset of Y. X — > V can be direct 
information, but it is used here as channel of information flow, in particular it might be a composite valid path, so in our 
context, X n Y is a Y— big subset of X. X n Z is a Z— big subset of X : this can only be big, and not BIG, as we have a 
composite path. 

The translation into big and small subsets and their modifications is now quite complicated: we seem to have to relativize, 
and we seem to need two types of big and small. This casts, of course, a doubt on the enterprise of translation. The future 
will tell if any of the ideas can be used in other contexts. 

We investigate this situation now in more detail, first without conflicts. 

The way we cut the problem is not the only possible one. We were guided by the idea that we should stay close to usual 
argumentation about big and small sets, should proceed carefully, i.e. step by step, and should take a general approach. 

Note that we start without any X— big subsets defined, so X is not even a X— big subset of itself. 

(A) The simple case of two arrows, and no conflicts. 

(In slight generalization:) If information <f> is appended at Y, and Y is accessible from X (and there is no better information 
about <j) available), <p will be valid at X. For simplicity, suppose there is a direct positive link from X to Y, written sloppily 
X — > Y \= (f>. In the big subset reading, we will interpret this as: Y A (j> is a cf)— BIG subset of Y. It is important that this 
is now direct information, so we have "BIG" and not "big". We read now I->y also as: X DY is an Y— big subset of 
X - this is the channel, so just "big" . 

We want to conclude by transfer that X n </> is a <j>— big subset of X. 

We do this in two steps: First, we conclude that X n Y n 4> is a (j>— big subset of X n Y, and then, as X n Y is an Y— big 
subset of X, X n 4> itself is a q>— big subset of X. We do NOT conclude that (X — Y) D <fi is a 0— big subset of X— Y, this 
is very important, as we want to preserve the reason of being cf>— big subsets - and this goes via Y! The transition from 
"BIG" to "big" should be at the first step, where we conclude that XHYDcj) is a 0— big (and not 0— BIG) subset of Xf)Y, 
as it is really here where things happen, i.e. transfer of information from Y to arbitrary subsets X n Y. 

We summarize the two steps in a slightly modified notation, corresponding to the diagram X — > Y — > Z : 

(1) If Y n Z is a Z-BIG subset of Y (by Y ^ Z), and X n Y is a Y-big subset of X (by X Y), then X n Y n Z is a 
Z-big subset of X n Y. 

(2) If X n Y n Z is a Z-big subset oflnF, and inFisa Y-big subset of X (by X -> Y) again, then XnZisa Z-big 
subset of X, so X . . . — ► Z. 

Note that (1) is very different from Cumulativity or even Rational Monotony, as we do not say anything about X in 
comparison to Y : X need not be any big or medium size subset of Y. 

Seen as strict rules, this will not work, as it would result in transitivity, and thus monotony: we have to admit exceptions, 
as there might just be a negative arrow X -/-> Z in the diagram. We will discuss such situations below in (C), where we 
will modify our approach slightly, and obtain a clean analysis. 

(Here and in what follows, we are very cautious, and relativize all normalities. We could perhaps obtain our objective with 
a more daring approach, using absolute normality here and there. But this would be a purely technical trick (interesting 
in its own right), and we look here more for a conceptual analysis, and, as long as we do not find good conceptual reasons 
why to be absolute here and not there, we will just be relative everywhere.) 

We try now to give justifications for the two (defeasible) rules. They will be philosophical and can certainly be contested 
and/or improved. 

For (1): 

We look at Y. By X — > Y, Y's information is accessible at X, so, as Z— BIG is defined for Y, Z— big will be defined for 

Y n X. Moreover, there is a priori nothing which prevents X from being independent from Y, i.e. Y n X to behave like 

Y with respect to Z - by default: of course, there could be a negative arrow X -/* Z, which would prevent this. Thus, as 

Y n Z is a Z— BIG subset of Y, Y D X n Z should be a Z— big subset of Y n X. By the same argument (independence), we 
should also conclude that (Y — X) n Z is a Z— big subset of Y— X. The definition of Z— big for Y — X seems, however, less 
clear. 

To summarize, YP\X and Y — X behave by default with respect to Z as Y does, i.e. Y n X n Z is a Z— big subset of Y n X 
and (Y — X) n Z is a Z— big subset of Y— X. The reasoning is downward, from supersets to subsets, and symmetrical to 

Y D X and Y— X. If the default is violated, we need a reason for it. This default is an assumption about the adequacy of 
the language. Things do not change wildly from one concept to another (or, better: from Y to Y AX), they might change, 
but then we are told so - by a corresponding negative link in the case of diagrams. 

For (2): 

By X — > Y, X and Y are related, and we assume that X behaves as Yf)X does with respect to Z. This is upward reasoning, 
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aside (which can also be considered as being big and small in various, per default independent dimensions) this is close to 
the reasoning with absolutely big and small sets: XP\Y — (XflYnZ) is small in X n Y, so a fortiori small in X, and 
X - {X DY) is small in X, so (X - (X n Y)) U (X n Y - (X n Y H Z)) is small in X by the filter property, so X n Y n Z 
is big in X, so a fortiori X n Z is big in X. 

Thus, in summary, we conclude by default that, 

(3) If Y n Z is a Z-BIG subset of Y, and X n Y is a Y-big subset of X, then X D Z is a Z-big subset of X. 

(B) The case with longer valid paths, but without conflicts. 

Treatment of longer paths: Suppose we have a valid composed path from X to Y, X . . . — ► Y, and not any longer a direct 
link X — > Y. By induction, i.e. upward chaining, we argue - using directly (3) - that X n Y is a Y— big subset of X, and 
conclude by (3) again that X n Z is a Z— big subset of X. 

(C) Treatment of multiple and perhaps conflicting information. 
Consider Diagram 19.5.11 (page I192[) : 



Multiple and conflicting information 



Z 




U X 

Diagram 9.5.1 



We want to analyze the situation and argue that e.g. X is mostly not in Z, etc. 

First, all arguments about X and Z go via the Y's. The arrows from X to the Y's, and from Y' to Y could also be 
valid paths. We look at information which concerns Z (thus U is not considered), and which is accessible (thus Y" is not 
considered). We can be slightly more general, and consider all possible combinations of accessible information, not only 
those used in the diagram by X. Instead of arguing on the level of X, we will argue one level above, on the Y's and their 
intersections, respecting specificity and unresolved conflicts. 

(Note that in more general situations, with arbitrary information appended, the problem is more complicated, as we have 
to check which information is relevant for some <fi - conclusions can be arrived at by complicated means, just as in ordinary 
logic. In such cases, it might be better first to look at all accessible information for a fixed X, then at the truth values and 
their relation, and calculate closure of the remaining information.) 

We then have (using the obvious language: "most A's are B's" for "A n B is a big subset of A" , and "MOST A's are -B's" 
for "A n B is a BIG subset of A") : 

In Y, Y", and Y n Y", we have that MOST cases are in Z. In Y' and Y n Y', we have that MOST cases are not in Z (= 
are in CZ). In Y' n Y" and Y n Y' n Y", we are UNDECIDED about Z. 

Thus: 

Y n Z will be a Z-BIG subset of Y, Y" n Z will be a Z-BIG subset of Y", Y n Y" n Z will be a Z-BIG subset of Y n Y" . 
Y' n CZ will be a Z-BIG subset of Y', Y n Y' n CZ will be a Z-BIG subset of Y n Y' . 

Y' n Y" n Z will be a Z-MEDIUM subset of Y' n Y", Y n Y' n Y" n Z will be a Z-MEDIUM subset of Y n Y' n Y". 

This is just simple arithmetic of truth values, using specificity and unresolved conflicts, and the non-monotonicity is pushed 
into the fact that subsets need not preserve the properties of supersets. 

In more complicated situations, we implement e.g. the general principle (P2.2) from Definition 19.2.21 f page 1 1 771 ) ■ to calculate 
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This will result in the "correct" truth value for the intersections, i.e. the one corresponding to the other approaches. 

It remains to do two things: (C.l) We have to assure that X "sees" the correct information, i.e. the correct intersection, 
and, (C.2), that X "sees" the accepted Y's, i.e. those through which valid paths go, in order to construct not only the 
result, but also the correct paths. 

(Note that by split validity preclusion, if there is valid path from A through B to C, a : A ■ ■ ■ — > B, B — > C, and 
a' : A ■ ■ ■ — > B is another valid path from A to B, then cr'oB-iC will also be a valid path. Proof: If not, then a' o B — > C 
is precluded, but the same preclusion will also preclude a o B — ► C by split validity preclusion, or it is contradicted, and 
a simila r argu ment appl ies again. This is the same argument as the one for the simplified definition of preclusion - see 
Remark iXt] (page E9]), (4)7) 

(C.l) Finding and inheriting the correct information: 

X has access to Z— information from Y and F', so we have to consider them. Most of X is in Y, most of X is in Y', i.e. 
X n Y is a Y-big subset of X, X n Y' is a F'-big subset of X, so X n Y n Y 1 is a Y n F'-big subset of X, thus most of 
X is in Y nY'. 

We thus have Y, F', and Y n Y' as possible reference classes, and use specificity to choose Y n Y' as reference class. We 
do not know anything e.g. about Y <~)Y' D Y" , so this is not a possible reference class. 

Thus, we use specificity twice, on the Y's— level (to decide that Y n Y' is mostly not in Z), and on X's— level (the choice 
of the reference class), but this is good policy, as, after all, much of nonmonotonicity is about specificity. 

We should emphasize that nonmonotonicity lies in the behaviour of the subsets, determined by truth values and comparisons 
thereof, and the choice of the reference class by specificity. But both are straightforward now and local procedures, using 
information already decided before. There is no complicated issue here like determining extensions etc. 

We now use above argument, described in the simple case, but with more detail, speaking in particular about the most 
specific reference class for information about Z, Y n Y' in our example - this is used essentially in (1.4), where the "real" 
information transfer happens, and where we go from BIG to big. 

(1.1) By X — > Y and X — > Y' (and there are no other Z— relevant information sources), we have to consider Y n Y' as 
reference class. 

(1.2) X (~i Y is a Y— big subset of X (by X — > Y) (it is even Y— BIG, but we are immediately more general to treat valid 
paths), X n Y' is a Y'-big subset of X (by X -> Y'). So X n Y n Y' is a Y n F'-big subset of X. 

(1.3) FnZisa Z-BIG subset of Y (by Y -> Z), Y 1 n CZ is a Z-BIG subset of F' (by Y' /> Z), so by preclusion 
Y n Y' n CZ is a Z-BIG subset of Y n Y'. 

(1.4) Y n Y' n CZ is a Z-BIG subset of Y n Y', and X n Y n Y' is a Y n Y'-big subset of X, so X n Y n Y' n CZ is a 

z-big subset ofinrnY. 

This cannot be a strict rule without the reference class, as it would then apply to Y n Z, too, leading to a contradiction. 

(2) If X n Y n Y' n CZ is a Z-big subset of X n F n F', and X n F n F' is a F n Y'-big subset of I, sold CZ is a 
Z— big subset of X. 

We make this now more formal. 

We define for all nodes X, Y two sets: B(X,Y), and b(X,Y), where B(X,Y) is the set of Y— BIG subsets of X, and 
b(X,Y) is the set of F— big subsets of X. (To distinguish undefined from medium/MEDIUM-size, we will also have to 
define M(X,Y) and m(X,Y), but we omit this here for simplicity.) 

The translations are then: 

(1.2') x n Y £ b(X, Y) and X nY' E b(X, Y') => x n Y n Y' e 6(X, Y n Y') 

(1.3') Y n Z e S(Y, Z) and Y' n CZ e B(Y', Z) => Y n Y' n CZ e B(F n Y', Z) by preclusion 

(1.4') y n Y' n cz e s(F n y', z) and x n y n y' e b(x, y n y') => x n F n y' n cz e &(x n y n y', z) as y n y' is 

the most specific reference class 

(2') x n y n Y' n cz e &(x n y n y', z) and x n y n y' e 6(x, y n y') =*> x n cz e b(x, z). 

Finally: 

(3') A e B(X, Y) -> A e &(X, Y) etc. 

Note that we used, in addition to the set rules, preclusion, and the correct choice of the reference class. 

(C.2) Finding the correct paths: 

Idea: 

(1) If we come to no conclusion, then no path is valid, this is trivial. 

(2) If we have a conclusion: 

(2.1) All contradictory paths are out: e.g. Y n Z will be Z— big, but Y n Y' n CZ will be Z— big. So there is no valid path 
via Y. 

(2.2) Thus, not all paths supporting the same conclusion are valid. 
Consider the following Diagram 19.5.21 (page I193[) : 
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Valid paths vs. valid conclusions 



Z 




Y Y' Y 




X 

Diagram 9.5.2 



There might be a positive path through Y, a negative one through Y', a positive one through Y" again, with Y" — ► Y' — > Y, 
so Y will be out, and only Y" in. We can see this, as there is a subset, {Y, Y'} which shows a change: Y 1 n Z is Z— BIG, 
Y' n CZ is Z-BIG, Y" n Z is Z-BIG, and Y n Y' n CZ is Z-BIG, and the latter can only happen if there is a preclusion 
between Y' and Y, where Y looses. Thus, we can see this situation by looking only at the sets. 

We show now equivalence with the inheritance formalism given in Definition 19.2.31 (page 11781) . 
Fact 9.5.2 

The above definition and the one outlined in Definition 19.2.31 (page 1178)) correspond. 
Proof 

By induction on the length of the deduction that X n Z (or X n CZ) is a Z— big subset of X. (Outline) 

It is a corollary of the proof that we have to consider only subpaths and information of all generalized paths between X 
and Z. 

Make all sets (i.e. one for every node) sufficiently different, i.e. all sets and boolean combinations of sets differ by infinitely 
many elements, e.g. Ad B C\ C will have infinitely many less elements than An B, etc. (Infinite is far too many, we just 
choose it by laziness to have room for the B(X, Y) and the b(X, Y). 

Put in X n Y G B(X, Y) for all X -» Y, and X n CY G B(X, Y) for all X -h Y as base theory. 

Length = 1 : Then big must be BIG, and, if X n Z is a Z-BIG subset of X, then X — > Z, likewise for X n CZ. 

We stay close now to above Diagram l9.5.1l (page [1921), so we argue for the negative case. 

Suppose that we have deduced X n CZ G &(X, Z), we show that there must be a valid negative path from X to Z. (The 
other direction is easy.) 

Suppose for simplicity that there is no negative link from X to Z - otherwise we are finished. 

As we can distinguish intersections from elementary sets (by the starting hypothesis about sizes), this can only be deduced 
using (2'). So there must be some suitable {Y ■ i & 1} and we must have deduced X D P|Y € b(X,f]Yi), the second 
hypothesis of (2'). If / is a singleton, then we have the induction hypothesis, so there is a valid path from X to Y. So 
suppose I is not a singleton. Then the deduction of X n f] Y.- L G P|Y) can only be done by (1.2'), as this is the only 
rule having in the conclusion an elementary set on the left in &(., .), and a true intersection on the right. Going back along 
(1.2'), we find X PI Yj G 6(X, Y), and by the induction hypothesis, there are valid paths from X to the Y. 

The first hypothesis of (2'), Xnf| YnCZ G &(Xnf| Y,Z) can be obtained by (1.3') or (1.4'). If it was obtained by (1.3'), 
then X is one of the Y, but then there is a direct link from X to Z (due to the "B", BIG). As a direct link always wins 
by specificity, the link must be negative, and we have a valid negative path from X to Z. If it was obtained by (1.4'), then 
its first hypothesis f] Yi D CZ G B(f] Y, Z) must have been deduced, which can only be by (1.3'), but the set of Y there 
was chosen to take all Y into account for which there is a valid path from X to Y and arrows from the Y to Z (the rule 
was only present for the most specific reference class with respect to X and Z!), and we are done by the definition of valid 
paths in Section l9~2l (page [T73| ) . 

□ 
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We summarize our ingredients. 

Inheritance was done essentially by (1) and (2) of (A) above and its elaborations (l.i), (2) and (l.i 1 ), (2'). It consisted of 
a mixture of bold and careful (in comparison to systems P and R) manipulation of big subsets. We had to be bolder than 
the systems P and R are, as we have to transfer information also to perhaps small subsets. We had to be more careful, 
as P and R would have introduced far more connections than are present. We also saw that we are forced to loose the 
paradise of absolute small and big subsets, and have to work with relative size. 

We then have a plug-in decision what to do with contradictions. This is a plug-in, as it is one (among many possible) 
solutions to the much more general question of how to deal with contradictory information, in the presence of a (partial, 
not necessarily transitive) relation which compares strength. At the same place of our procedure, we can plug in other 
solutions, so our approach is truly modular in this aspect. The comparing relation is defined by the existence of valid 
paths, i.e. by specificity. 

This decision is inherited downward using again the specificity criterion. 

Perhaps the deepest part of the analysis can be described as follows: Relevance is coded by positive arrows, and valid 
positive paths, and thus is similar to Kripke structures for modality, where the arrows code dependencies between the 
situations for the evaluation of the modal quantifiers. In our picture, information at A can become relevant only to node 
B iff there is a valid positive path from B to A. But, relevance (in this reading, which is closely related to causation) 
is profoundly non-monotonic, and any purely monotonic treatment of relevance would be insufficient. This seems to 
correspond to intuition. Relevance is then expressed in our translation to small and big sets formally by the possibility of 
combining different small and big sets in information transfer. This is, of course, a special form of relevance, there might 
be other forms. 
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Chapter 10 

Argumentation 



10.1 Blocking paths 

We have two arrows a — > b (positive) and a-f*b (negative). The basic idea is that one negative argument blocks all positive 
ones. (We will see later that this is not a restriction.) 

This is a generalization over just ending the relation — > at a point, as some x might be accessible from a, but not from 
{a, b}, as b introduces a blocking argument - thus it becomes non-monotonic. Of course, the blocking might itself be 
blocked by a new c in {a, b, c}, etc. 

E.g. a — > b c will block a — > a 

The question is which nodes are visible from a (set of) node(s), we denote this set a ov A (in the case of a set). In above 
example, b G a, but c a. Intuitively, we can see this as a relative horizon. 

We will assume that there are no cycles, and that the networks are finite. 

Suppose a ■/* b is in a network, then we can look at {a, b}, and we then force b to be valid, despite a -/+ b. "Outside forcing" 
overrides negative arrows. 

Fact 10.1.1 

(1) A version of Cumulativity holds: (Cum) ACBCl4'6=I 

(2) If x G A, but s^iU {a}, then a /> x, or there is some new a' G A U {a} — A. 

(3) ir £ A, x G 4u{a} ^> x G a (x might just have destroyed a counterargument). 

(4) x G A, x G B => x G XU~B? 

Proof 

□ 



There are problems of ambiguity. 
Example 10.1.1 

Consider a — > b, c, and then add (1) nothing (2) a /> c (3) 6 ■/+ c (4) a /> c and 6 c (5) b ■/* c and a — > c They cannot 
be distinguished - b G a will always be the case, but neither a nor 6 "lead" to anything else, as a — > c is destroyed by 
a — > 6 /> c. 

Of course, giving more information, e.g. some d — > 6 and d — > c allows us to see if 6 ^ c is present. Thus, giving some 
special nodes to "read out" information and "feed in" information can disambiguate. 

A more serious problem is the following: 
Example 10.1.2 

b ^ c -fr . x, b -fr d ^ x, a -/+ c, a ^ d are the base arrows. Adding at least one of a — > x or b — > x will give the same 
information - irrespective of whether we add just one or both. But the case without adding any of the two is different. 
This is easily seen by examining the cases. 

For a, a — > x is not necessary, as a — > d — > x is valid. For b, b — > c -f^> x will destroy 6 — ► x. For {a, b} : a -/^ c blocks c, so 
c /> x is not valid, and x is free. But 6 -f+ d blocks d — > x, so x is not accessible via cZ, so we need at least one of a — > x or 
6 — > x. 
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10.1.0.0.19 New results (5.4.08) 

We will write now X + x for X U {x} etc. 
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Fact 10.1.2 

(1) Let a — > x, b — > y be such that 

(a) for all A either 

(1.1) A = A + a — > x = A + 6 — > y = >l + a— >x + b— >y or 

(1.2) A ^ 3 + a -> x = A + b —> y = A + a -> x + 6 — ► y 

(b) there is at least one A with property (1.2). 
Then x = y. 

(2) Let a — > x, 6 — > y be such that 

(a) for all A either 

(1.1) A = A + a^ x = A + b y = A + a ^ x + b y or 
(1.2') A = A + a -> x = A + 6 ->• y ^ ,4 + a^x + fc^y 

(b) there is at least one A with property (1.2'). 
Then x = y. 

(3) Situation (2) is impossible. 

(4) Let A ^ A + a —> x = A + b^ y. Then x = y. 

(5) Let A ^ A + a—>x = A + b^ x. Then A + a —> x = A + b -> x = A + a— >x + 6->x. 
Analogous properties hold for negative arrows: 

(6) Let A ^ A + a^x = A + b -h y. Then x = y. 

(7) Let A ^ A + a-^> x = A + b^> x. Then A + a x = J+TTTx = 4 + a/>i + t/>i. 

(8) Let A = A + a x = A~+Y^~y ^ A + a x + b -f-> y, then x = y. 

(9) A = A + a -f* x = A + b -f+ x ^ A + a -f* x + b -f* x is impossible. 

Proof 

(1) Let A be with property (1.2). AsA/i + o^i, a-ti has an effect, this can only be because x ^ A, x G 
A + a — > x. Analogously yg'A, yGA + 6— >y. Thus x, y ^ A, x, i/gi + a- > x = A + o — >y = J 4 + a^x + 6^j/. Then 
A + a — > x = A + x, and y A, y G A + x, thus x is before y (not necessarily x — > y or so, maybe x /> z -f^ y etc.). 
Analogously y is before x, as the diagram is free from cycles, x = y. 

(2) If x, y G A, then A = A + a^x + b^y. Ifx G A, then A + b — > y = ^4 + a^x + fr^y, as x is already present, 
contradiction. Analogously fory, sox,y g'A As A + a ^ x 7^ A + a^x + fo^y, y G ^4 + a^x + fr^y (otherwise, 
+6 — > y would have no consequences). Analogously for x, thus x,y G A + a^x + b^>y. As x g" A + a — > x, but 
leA + o^i + in)/, y has to be before x, analogously x before y, so again x = y. 

(3) Thus A = A + a^x = A + 6^X7^A + a— >x + o— >x. But e.g. A + fr^x^A + a— >x + o— >xis impossible, as 
the number of supporting arguments is unimportant. 

(4) The proof of (1) uses only the new prerequisites. 

(5) The number of arguments are unimportant. So A + a->x = A + 6^x = A + a^x + o—>x = A + x. 

(6) a 7^ x has effect, so x G A, x A + a -/-> x, analogously for y G A, y A + b 7^ y. Thus, again, x is before y, y before 
x, so x = y. 

(7) Again, the number of arguments is unimportant. 

(8) If x ^ A, then J 4 + 67 Z >y = ,4 + a^x + &7 Z >y, contradiction. Thus, x,y G A. As A + a ^ x 7^ A + aT^x + b^y, 
y ^A + aT^x + fr/^y, otherwise, b ■/* y would have no consequences. Analogously for x, so x, y ^ A + a -/^ x + b 7^ y. 
As x G A + a 7A x, but x^A + ay^x + frT^y, x has to be before y, and vice versa, so x = y. 

(9) xGA = A + a7^x = J 4 + 6 7 4x, so a, 6 g" A, and xGA + aT^x + og^y, contradiction. 
□ 



Fact 10.1.3 

(1) A 7^ A + a — > x (meaning: same A, but added to graph a — > x) — > (1.1) a G A (1.2) x g 1 A, x G A + a ^ x (1.3) all z 
s.t. zeinz^iia^x are downstream from x (1.4) A + a — * x = A U {x} (meaning: graph unchanged, but added 
x to A) 

fO\ ~A U- A 1 „ T -- ^ , (1 1\ „ r 7 f1 11 - rl ^ H A I „ T -- ^ M o> „11 » „ i „ ^ ~~A . . „ ^/ /I I „ T -- ^ ^„ T „o+.™ m 
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(3) If x G A, then A = A + a ^ x. li x,y e A, then A = A + a->x + b^y. 



New proof for (2) in above Fact 2: As A + b^y^A + b^y + a^x, so by (1.2) above a; g' A = ^4 + & ^ y, and 

xeA + a^x + fe^y. Analogously, y^^4, i/6 4 + a->i + 6^j;. Sox,y^A, x,y&A + a^x + b^y. The rest of 
the argument holds by (1.3) above. 

New proof for (3) in above Fact 2: x $ A = A + a — > x — > a ^ A or ex. c £ A, c /> i. i e 4 + + so 

6 £ ^4 + a — > x, so 6 e A, as b is upstream from x, and there is no c € A, c ■/> x. Analogously for a € A. 
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